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PREFACE 

The Engineering Design Handbooks of the U. S. Army Materiel Command 
are a coordinated series of handbooks containing basic information and 
fundamental data useful in the design and development of Army materiel 
and systems. 

This text treats a broad class of optimal design problems through use of a 
consistent set of computational techniques ideally suited for computer 
application to mechanical design problems. No attempt has been made to be 
exhaustive in the treatment of optimization techniques or the full range of 
mechanical applications. Rather, the class of problems treated is concisely 
formulated (in Chapters 4 and following) in terms of design and state 
variables that occur in mechanical design. A steepest-descent approach — 
which has served as a workhorse, reliable technique in fields such as 
aerodynamic system design, control theory, and nonlinear programming - is 
developed here for mechanical system design. 

Extensive application of design optimization techniques is made in the 
field of structural design, as well as in a limited number of specific weapon 
design problems. The examples are presented in considerable detail, as they 
are encountered in practice, to provide the practicing engineer with insight 
into use of the methods for his class of problems. A consistent design 
philosophy is maintained throughout the text to assist the designer in 
extrapolating the methods to classes of problems that are only similar 
mathematically to the examples treated here. 

The text is structured so that it can be understood and used by practicing 
engineers with a good background in calculus and matrix theory. Computa¬ 
tional algorithms are stated in considerable detail so that they can be 
effectively implemented by junior engineers, with only problem formulation 
and general supervision provided by a senior project engineer. As with 
virtually all computer aided design techniques, some computing art is 
required for effective implementation of these techniques. The detailed 
treatment of structural applications in Chapters 5, 7, and 9 should provide 
insight into this computational art. References are given to more advanced 
literature for proofs of theorems and extensions of methods to other classes 
of problems. 
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CHAPTER 1 

ELEMENTS OF COMPUTER AIDED DESIGN 


1-1 SYNTHESIS VS ANALYSIS IN ENGI¬ 
NEERING DESIGN 

Engineering is defined (Ref. 1) as “the art 
or science of making practical application of 
the knowledge of pure sciences such as 
physics, chemistry, biology, etc.”. Although 
broad, this definition implies that the job of 
engineering is to synthesize, or put together, 
useful systems by applying knowledge and 
methods derived from the “pure” sciences. 
The meaning of “practical” in the given 
definition should be interpreted as best, or 
optimal; Le., the job of engineering design is 
to develop the best possible system for the 
given application, consistent with the re¬ 
sources allocated to the development phase. 
The purpose of this handbook is to present a 
class of methods that allow for efficient use 
of the computer in the design process. 

Since the computer can be viewed simply 
as a device to handle large quantities of data 
and perform simple algebraic operations and 
logic rapidly, it is important to look first into 
the role of calculation in design. The usual 
approach to design is to conceive of a 
candidate system and then test it to see if it 
works. Great strides have been made with 
digital computers in the past two decades to 
allow for numerical analysis as a test of the 
idea, or concept, rather than previous cut-and- 
try techniques. For example, in structural 
design one chooses the configuration and 
member sizes, and then tests the structure by 
analyzing its response to given loads. If the 
structure does not behave as desired, then de¬ 


sign changes are made and the structure is re¬ 
analyzed. This process continues until the 
designer is satisfied with his design. This has 
been the principal use of the computer in the 
design process. 

In general, then, before the designer can 
assure himself that he has the best system, he 
must be capable of analyzing all candidates. 
In the past half century, outstanding advances 
in engineering analysis have been made. The 
digital computer has allowed the engineer to 
quantitatively analyze the behavior of systems 
that were examined only qualitatively in the 
past. The mechanical sciences, particularly, 
have benefited from this boom in analysis 
capability. Structural analysis, stress analysis, 
analysis of mechanisms, and heat transfer 
analysis, just to name a few, have made 
spectacular advances in the past twenty years. 

Until the early 1960’s, and even to the 
present day to a lesser extent, the attention of 
engineering research has been focused pri¬ 
marily on developing analysis capability. Dur¬ 
ing this period of emphasis on analysis, 
inadequate attention was paid to developing a 
synthesis, or design, capability that is able to 
efficiently use the newly developed analysis 
methods. In some of the mechanical sciences, 
this problem is particularly acute. In struc¬ 
tural mechanics, for example, it is possible to 
analyze a structure under a given loading to 
obtain accurate values for stress, displace¬ 
ment, and even natural frequency. It is not 
clear, however, how a structure should be laid 
out and proportioned to efficiently utilize 
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material in order to meet strength require¬ 
ments. A more difficult problem is the pro¬ 
portioning of a structure so as to efficiently 
limit displacement and meet constraints on 
natural frequency and buckling. For a review 
of the state of optimal structural design 
through 1967, see Ref. 2. 

It appears that the analysis capability 
needed for computer aided design is available. 
The next problem to be addressed, then, is 
the matter of what is meant by best, or 
optimum. The idea of best enters very natu¬ 
rally into engineering design efforts. In 
profit-motivated industries as well as in 
Government laboratories, the objective is to 
maximize some return function while satisfy¬ 
ing constraints such as resource allocation, 
performance requirements, and human limita¬ 
tions. 

Once some return function or measure of 
value is chosen and constraints are identified, 
the system designer would like to have some 
optimal design methodology that is capable of 
aiding him in the determination of the best, 
or practically best, system. It must be empha¬ 
sized at this point that the search is not for an 
automatic optimization technique that can 
solve any design problem fed to it. Rather, 
the need is for an optimal design methodol¬ 
ogy that can aid the engineer in the imple¬ 
mentation of his concepts and guide him in a 
direction which, if continued indefinitely, 
would yield a mathematical optimum. 


A key challenge to developers of practical 
computer aids to designers is to take maxi¬ 
mum advantage of human judgment in the 
design process. The potential of interactive 
computation and design information display 
is only now in a developing stage and holds 
promise for significant improvement of the 
value of the computer in design. 

1-2 THE PHILOSOPHY OF SYSTEM ENGI¬ 
NEERING 

In the middle 1950’s a formalized approach 
to the development of large-scale, man-made 
systems began to appear in the literature, see 
Refs. 3, 4, 5. This approach, which has 
features common to most problem solving 
processes, was given the name ‘‘system engi¬ 
neering” and is the very essence of computer 
aided design. A feature which sets system 
engineering and computer aided design off 
from most of the logical problem solving 
schemes is the explicit inclusion of key 
considerations peculiar to engineering design 
of systems. A second important feature of 
system engineering is the attention paid to 
quantitative description of the system and its 
behavior. 

The basic idea in system engineering is to 
begin with a statement of system require¬ 
ments and objectives, and move in an orga¬ 
nized way toward an optimum system. A 
process which illustrates the approach is 
shown in Fig. 1-1. 



Needs and 
Objective 


Output 


Figure hi. A System Engineering Model 
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The purpose of this text is not to give a 
detailed treatment of system engineering, but 
rather to present aspects of the theory of 
computer aided design, with emphasis on 
optimal design. The simplified model of a 
system engineering process shows that opti¬ 
mal design is a part of system engineering, 
but, indeed, by no means the dominant part. 
The purpose of this paragraph is to discuss the 
interface of optimal design with the remaining 
essential elements of system engineering. 

System engineering begins with the identifi¬ 
cation of a need by a potential user of the 
system to be developed. It is often the case 
that the user knows that he needs a system to 
do a job, but he may have difficulty in stating 
his needs and objectives quantitatively. It 
then becomes the joint responsibility of the 
system engineer and user to quantify system 
objectives so that a meaningful set of objec¬ 
tives may be established for the development 
to follow. 

Once the needs and objectives for a system 
are identified, it is necessary to define func¬ 
tions that must be performed by the system 
and any subsystems that are required. This 
process is called function analysis, and its 
purpose is to pick out functions or operations 
that must be performed in order to accom¬ 
plish the mission required of the system being 
developed. These functions then become 
lower level objectives for the development of 
subsystems. Identification of functions tends 
to be qualitative in nature. However, once a 
function or operation is identified, it must be 
described in quantitative terms, if at all 
possible. For example, if a function must 
occur quickly, the time allowed should be 
speciHed. 

The next step shown in Fig. 1-1 is one of 
the most difficult functions in system engi¬ 
neering and certainly the most difficult step 


to describe analytically. Conceptual design, as 
its name implies, is the identification of the 
various concepts or basic system configura¬ 
tions that might meet the system objectives. 
It is desirable in this step to leave the 
concepts as general as possible so as not to 
eliminate candidate systems that might be 
very effective. For example, if the function to 
be performed is to propel a vehicle over the 
surface of the earth, conceptual designs might 
include wheels, tracks, legs, air cushion, etc. 

It is important at this time to identify 
ranges of values of parameters describing the 
system so that, for any parameter in this 
range of values, the system will perform the 
functions identified in the previous step, i.e., 
the set of parameters describing admissible 
systems is identified. It is at this time that the 
experienced designer can be extremely valu¬ 
able in reflecting state-of-the-art capabilities 
of technologies involved in the system devel¬ 
opment. 

The optimal design step has as its objective 
the choice of the undetermined parameters 
identified in the previous step. These param¬ 
eters must be in the ranges defined by 
technological limitations and system func¬ 
tions. The criterion for choosing system 
parameters is maximization of system worth 
or value. It should be emphasized that a 
mathematically precise optimum may be im¬ 
possible to attain and must therefore serve 
only as a goal. Methods for choosing system 
parameters should, however, have the prop¬ 
erty that if an optimum does exist, then given 
enough patience and computer time, that 
optimum should be approached as a limit. 

What appears to be the final step in the 
system engineering model of Fig. 1-1, Descrip¬ 
tion, is, in reality, probably just an inter¬ 
mediate step. Unless the system design pro¬ 
cedure has been unusually effective, the sys- 


1-3 


Digitized by L^ooQle 


AMCP 706-192 


tern decided upon will probably not satisfy 
the user. More likely, it will probably not 
satisfy the system engineering team. Having 
the results of one pass through the system 
engineering process, the user can probably 
remember some constraints which he foigot 
to specify and which the optimum system 
violates. The designer probably also will see 
concepts that he did not see before. Much as 
the user, he too will remember technological 
constraints which he forgot to specify and 
which the optimum system violates. Finally, 
the sponsoring activity will undoubtedly de¬ 
cide that it will be all right to decrease the 
measure of system value a small amount if it 
will save some money. 

The next step in the procedure is for each 
member of the team to take a deep breath, 
sigh, and go back to work, armed with his 
hard earned new knowledge. It is for this 
purpose that all the feedback paths in the 
model of Fig. 1-1 are shown. This iterative 
procedure is then continued until the sponsor¬ 
ing activity decides that the system developed 
is what it really needs. This will probably be 
another human decision, rather than a pro¬ 
grammed mathematical one. 

The remaining chapters will be devoted to 
the problem of computer aided and optimal 
design. If the design methods presented later 
are to be of maximum value to the reader, he 
must have a clear picture of how these 
methods fit into the larger problem of system 
engineering. For further literature on the 
basic ideas involved in system engineering, see 
Refs. 3, 4, and 5. 

1-3 COMPUTER AIDED DESIGN IN THE 
MECHANICAL SCIENCES 

The theory of computer aided and optimal 
design is developed in subsequent chapters as 
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it applies to the mechanical sciences. There 
are peculiarities of mechanical design, as 
opposed to classical control system design, 
which require specialized treatment. Further, 
the mathematics involved in mechanical sys¬ 
tem design is quite different from the math¬ 
ematics of control theory. These distinctions 
are highlighted throughout the text. 

In the chapters that follow, optimal control 
theory is interpreted as treating feedback 
controllers; i.e., an optimal control system has 
active elements that sense errors in output, 
due to fluctuations in inputs, and adjust 
system controls so as to maximize some 
measure of system performance. Optimal de¬ 
sign, on the other hand, is taken as the 
problem of choosing system elements or 
parameters describing these elements, which 
are fixed for the life of the elements, so that 
the system is optimum in some sense. In 
control literature this is called open loop 
control. The principal difference in the two 
problems is that the variables chosen in the 
optimal design problem are fixed for the life 
of the system, whereas variables in a feedback 
control device are to be adjusted according to 
inputs as the system operates. Mathematical¬ 
ly, the difference in the two results is that the 
control law describes how the system vari¬ 
ables should be adjusted as a function of the 
state of the system, whereas an optimum 
design is simply a set of parameters describing 
system elements and will not be changed 
during the life of the system. This distinction 
is not uniform in the control literature but is 
used here to identify the class of problems 
treated. 

In most literature on control problems, 
sequential systems are treated, i.e., operations 
of the system progress one after another as if 
they were occurring in time in a pre-arranged 
order. Many optimal design problems are not 
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of this kind. For example, in designing a 
structure one must be concerned with stresses 
due to applied loads. These stresses are 
interpreted as the state of the structural 
system. They are determined by a boundary- 
value problem that cannot be interpreted as a 
sequential process (initial-value problem). In 
some design problems it is possible to define 
auxiliary variables so that the governing equa¬ 
tions form an initial-value problem with addi¬ 
tional constraints. This procedure, however, 
generally complicates the problem unneces¬ 
sarily. For this reason the problems in 
succeeding chapters are formulated as bound¬ 
ary as opposed to initial-value problems. 

In order to illustrate the use of the meth¬ 
ods presented, applications are made pri¬ 
marily in optimal structural design. Applica¬ 
tions are chosen to illustrate the use of the 
methods on problems having a number of 
design variables which might be found in 
engineering applications. Further, since many 
of the methods are relatively new, it is 
anticipated that improvements in computa¬ 
tional efficiency may be realized in specific 
problems if advantage is taken of special 
features of the class of problems treated. 

It is appropriate to highlight a significant 
computational distinction between two 
classes of design problem. The reader may 
note that Chapters 2 through 5 of this text 
deal with problems in which system design 
and performance are specified by a finite 
number of parameters (real numbers). Chap¬ 
ters 6 through 9, on the other hand, deal with 
systems that are described by functions on 
some given space or time domain. Mathemati¬ 
cally, these problems are called finite and 
infinite dimensional, respectively. Optimiza¬ 
tion theory for these two classes of problems 
can be put in the same form, but there are 
very real differences in the computational 


techniques available for design optimization. 
Since the subject of this handbook is com¬ 
puter aids to design, the practical distinction 
is made here. For a unifying mathematical 
treatment, the reader is referred to Ref. 7. 

Finally, it is important to realize that 
engineering design optimization and engineer¬ 
ing analysis are fundamentally different in 
nature. In analysis, one is generally assured 
that a solution exists and numerical methods 
are generally stable. In optimal design, on the 
other hand, existence of even a nominal 
design satisfying objectives is not assured, 
much less existence of an optimal design. 
Moreover, even when an optimum exists, 
numerical methods for its solution are often 
quite sensitive to initial estimates and require 
much computational art for iterative con¬ 
vergence. These properties will be observed 
over and over in this handbook when example 
problems are treated. 

It is important that the reader take a 
mathematical outlook when doing computer 
aided design and optimization. A purely 
intuitive approach can lead to erroneous 
results that may not be apparent until some¬ 
one happens onto a nominal design which is 
vastly superior to a “supposed” optimum 
design. 

1-4 MATHEMATICAL PRELIMINARIES 

The level of mathematical background re¬ 
quired for an understanding of the methods 
of optimal design presented in the following 
chapters is a course in advanced calculus and 
the ability to use matrix notation. Since 
engineers often require results of rather deep 
mathematical analyses to solve real-world 
problems, several results have been accepted 
with references given to proofs. The purpose 
of this paragraph is to present notation and 
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some basic mathematical ideas used through¬ 
out the text. 


Since most real-world problems involve 
several parameters, it is convenient to utilize 
vector notation. For example, rather than 
writing x j..., jc^ repeatedly, these n variables 
are collected into a column vector 


^1 


X = 


X 


n 


(M) 


Unless otherwise noted, all vector variables 
will be column vectors. A vector of the form 
(Eq. 1-1) may be interpreted as a point in 
n-dimensional real space, . The space /?" is 
simply the collection of all n-vectors of real 
numbers. For example, the real line is R * and 
the plane isR^, 


It will often be convenient to deal with a 
collection of points in the space R”. A 
collection of points D in /?" will be called a 
set, or a subset of /?". A point jc in /?" which 
is in D will be denoted xgD. This will be the 
extent of set notation required in later chap¬ 
ters. 


In /?" there is a well defined idea of length 
of a vector. This analog of length in the real 
world will be denoted 


IUI| = 



1/2 


( 1 - 2 ) 


and is called a norm on R ”. There are many 
norms defined on /?" but Eq. 1-2 will be 
sufficient for the purposes of this text. Along 
with the idea of norm on R^ goes the concept 
of dot product or inner product. The inner 
product of two elements x and y of /?" is 

< x.y> =x^y=i xiy, . (1-3) 


1-6 


Two vectors are called orthogonal if their 
inner product is zero. 

The idea of convergence of a sequence 
{ x^hn R” with norm (Eq. 1-2) is much like 
convergence of real numbers. That is, x* = 

X if for any e > 0 there is an N > 0 such that 
II x^ - XII < € for all i > N. An important 
property of sets in optimization theory is 
closedness. A subset D of is called closed 
if every sequence in D which converges has its 
limit in D. 


Just as the idea of collecting n real numbers 
into a vector in it is helpful to define a 
vector function g(x) for xG/?" as 





(1-4) 


Such a function is called continuous at x if 
for any e > 0 there is a 6 > 0 such that ||g(x) 
-^(jc)ll;„ < € if ll'x -X 11^ < 6.The subscripts 
m and n on the norms denote the dimension 
of the space on which the norm is defined. 


It will often be desirable to deal with a set 
of functions which satisfy 


g^(x)<0,1=1, ...,m . (1-5) 


In this case it is convenient to define inequal¬ 
ity among vectors as 

gixXO (1-6) 

where inequality is taken componentwise, i.e., 
Eq. 1-6 is defined to mean the same thing as 
Eq. 1-5. 

One of the most useful notations in the 
following chapters is the idea of the derivative 
of a vector function with respect to its vector 
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variable. This notation is 



where / is a row index and / is a column index. 
If fix) is a real valued function of this 
notation is 


ax 


9/U). 


( 1 - 8 ) 


The derivative of a real valued function is 
often called the gradient of that function and 
is denoted 


+ +J_ 2 a* f(x) 

(y, -ac,)'i . ■ • (y - r )'» 


1 2 a***/(^) 

■(*+1)!/^+...+/„=*+! 


(yi-x,)'^ - [y„-xj». 


For proof of this theorem see Ref. 6, page 56. 


7/(jr)=iO£2 . (1-9) 

dx 

The gradient is one of the few standard 
symbols which denotes a row vector rather 
than a column vector. Likewise, for a real 
valued function the matrix of second deriva¬ 
tives may be defined as the matrix 



In many places in the following chapters, 
Taylor’s Theorem will be used to obtain an 
approximate expression for a function at a 
point sufficiently near a point where the 
function is known. The most common ap¬ 
proximation is the one obtained by deleting 
second and higher order terms. For example, 
if lU - y\\ is small, 

/(y)-/(jc)“^^(y-ac) (1-12) 

dx 


An important theorem in the analysis of 
functions appearing in optimal design prob¬ 
lems is Taylor’s Theorem. 

Taylor*s Theorem: Let the real valued 
function f(x) have k + 1 continuous deriva¬ 
tives in R". Then for xeR", there is a point 
£ = ax + (1 - OL)y with 0 < a < 1, such that 


/0')=/W+ 2 
/ * 


mx) 

bx, 




( 1 - 11 ) 


1 n n b^fix) 
+— 2 £ 

2 / = i /-I dx^ bxj 


(yj - Xj)(y. - Xf) 


where by Eq. 1-11 the error in Eq. 1-12 is 
at most a constant times \\y - x\\^ if/(x) has 
bounded second order derivatives. The left 
side of Eq. 1-12 is generally denoted by 
where y - x is denoted 6x. In this 
notation. 


6/U) = ^6x. (1-13) 

Eq. 1-13 may be thought of as a total 
differential. Even for vector functions g(x), 
Eq. 1-13 holds for each component so if 

= [6gi(x),..., 6g^(x)l then 
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Sg(x) = 


dg(x) 

dx 


fix 


(1-14) 


In later work, will often be a function 
of acGi?" and z&RP. In this case, Eq. 1-14 is 


hg{x,z) = 


^g(x.z) 

dx 


fix + 


dg{x,z) 

dz 


fiz 


(1-15) 


where 


hg{x,z) _ 



Most of the common notation used in later 
chapters now has been defined. Special nota¬ 
tion and results required locally for some 
development will be defined and used there. 


1-5 ILLUSTRATIVE MILITARY COM¬ 
PUTER AIDED DESIGN PROBLEMS 


In this paragraph two illustrative military 
optimal design problems are formulated, and 
computer aided design techniques are out¬ 
lined for their solution. The treatment here is 
only for the purpose of introducing concepts. 
These examples are treated in more depth in 
Chapters 7 and 8. 

1-5.1 OPTIMAL DESIGN OF STRUCTURES 


The optimization technique described in 
this paragraph was initially developed for 
application to minimum weight structural 
design problems. For this reason, and to give 
an engineering feel for application of the 
technique, the method will be presented along 
with examples from the field of optimal 
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structural design. 


As a specific example, let us consider a 
design problem whereby a highly directional 
transmission device, or perhaps a gun, is to be 
mounted on a tower or gun mount that is 
required to support the device at some given 
distance away from the basic supporting 
structure, such as the earth. A schematic of 
the problem is shown in Fig. 1-2. The basic 



Figure 1-2. Structural Requirement 


problem is to design a structure that supports 
the device under consideration and which is as 
light as possible for purposes of transporta¬ 
tion and erection on the battlefield, or per¬ 
haps mounting on a helicopter. A basic design 
requirement for this structure is that the 
device mounted on the top shall not have an 
angular deflection of more than B radians, in 
order to hit the receiver or target. The loading 
that is to be considered is a wind load of up 
to a given velocity, which would cause angular 
deflection of the top of the tower. 

The needs and objectives in this design 
problem are well established, so no additional 
inputs need be considered at the present time. 
Further, the requirement that the tower 
support the device with only a given allowable 
angular deflection is the only basic function 
required of the tower; thus the function 
analysis block of Fig. 1-1 is also complete. 
The next stage, and one that is quite difficult 
to describe analytically, is that of arriving at 
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conceptual towers which might perform the Pigs. 1-3(A) and (B), involve rigidly fixing the 

given mission. tower at its base to the fundamental support¬ 

ing structure. In both towers, variable spacing 
Four different conceptual designs are as a function of height is allowed between 

shown in Fig. 1-3. The first two concepts, vertical members of the structure. In addition. 



(Q (D) 

Figure 1-3. Conceptual Designs 


one of the concepts allows for varying the ent area and spacing as desired. Three are 

area of the main structural members as a shown for convenience in the figure, 
function of height. The second set of con¬ 
cepts, Figs. 1-3(C) and (D), involves towers In each of the conceptual towers of Fig. 

that are pinned at their base to the supporting 1 - 3 , the variables bi through describe the 

structure and that are supported by guy wires variable spacing of the members of the tower, 

at the top of the structure. Likewise, in both In two of the concepts. Figs. 1-3(B) and (D), 

of these concepts, variable spacing of the b^ through 65 specify the variable areas in 

main vertical members is allowed. In the the construction of the main vertical member, 

second concept, variation of area along the These variables serve as design parameters, in 
length of the tower is also allowed. It should that the designer can choose these variables 

be noted that the conceptual designs in Fig. and completely specify the design of the 

1-3 can have as many subsections with differ- tower. 
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In addition to the design variables, a main 
part of the design problem is the behavior of 
the structure under wind load, since one of 
the migor constraints on behavior of the 
structure is that the angular deflection of the 
top of the tower not exceed an angle 0, For 
this reason, the angular deflection of each of 
the joints must be determined, along with 
lateral deflection due to lateral wind loading. 
This is a relatively routine analysis problem 
when one uses the techniques of finite ele¬ 
ment structural analysis. Not shown in Fig. 
1-3, but required in the construction, are 
cross members which maintain spacing of the 
main vertical members. In order to state the 
optimal design problem mathematically, first 
define vectors of design variables b^ and state 
variables 

(1-16) 

Z = IZ,, Zj,...Z„J^. 

Using finite element structural analysis tech¬ 
niques, define the stiffness matrix as 

where the dependence of stiffness on the 
design variables is explicitly shown. Using this 
matrix, the structural response is given by the 
following matrix equation 

A{b)z^q (1-18) 

where q is the wind loading matrix. 

Now that the relationship between the 
design variables and the structural response is 
specified by Eq. 1-18, the next step in 
formulating an optimal design problem is the 
identification of constraints. In order to 
prevent dimensions or structural areas from 
going to zero, resulting in an unstable struc- 
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ture, it is required that the design variables be 
bounded uniformly away from zero. This is 
given formally by the inequality 

> ^/o > 0, f = 1,.. ^rn. (1-19) 

The fundamental constraint in the present 
problem is that the angular deflection at the 
top of the tower not exceed the angle 0, This 
is expressed analytically by the inequality 

|zil<0. (1-20) 

The final step in formulation of an optimal 
design problem is to identify the cost func¬ 
tion to be minimized. In the present case, the 
cost function is structural weight J and is 
given by an expression of the form 

J = c,b, (1-21) 

where y is material density and are weight¬ 
ing factors representing lengths of structural 
elements and weight requirements for lateral 
stiffners. 

We now have an optimal structural design 
problem that is well formulated from a 
mathematical point of view. The objective is 
to find the design variables bi through b^ 
that satisfy constraint Eqs. 1-19 and 1-20, and 
which minimize the structural weight as given 
by Eq. 1-21. The technique used to solve this 
problem, and in fact a large class of optimal 
system design problems, is based on a very 
simple idea of engineering design. The idea of 
the technique is to allow small variations in 
some nominal design, and analyze the effect 
of these variations on the equations of the 
problem and the cost function associated with 
the problem. As a result of allowing only 
small design changes, certain approximations 


Digitized by L^ooQle 



AMCP 706-192 


may be made that allow the best change in 
design variables to be determined in order to 
decrease the cost function of the problem as 
much as possible, while still not violating 
constraints of the design problem. For 
example, one might choose as an initial 
estimate of the optimal design a uniform 
tower as shown in Fig. 1-4. The estimated 
design variable in this case is denoted by 



Figure 1-4. Uniform Initial Design 

Let hb be a small change in the design 
variable Any change in the design 

variable will result in a change in the struc¬ 
tural response, denoted by 5z. The nature of 
the structural analysis problem guarantees 
that small bb yields small bz. Further, a 
Taylor series approximation of terms appear¬ 
ing in Eq. 1-18 yields 

+4-{Aib)z\ (1-22) 

Ob b^b^ ^ 

If an inequality constraint is violated, such 
as 

b^<b^^ (1-23) 

then in order to correct the constraint error it 
is required that 


^bi> bio-b^, (1-24) 

Or, if the angular deflection constraint is 
violated, for example, 

z^> e (1-25) 

then, to correct the constraint error it is 
required that 

6 z, (1-26) 

Finally, the change in structural weight due to 
the change in design bb is given by 

5/ = 7 2 c.bb,. (1-27) 

The object of the new problem is to 
determine bb so as to minimize the linearized 
cost function of Eq. 1-27, subject to con¬ 
straint Eqs. 1-24 and 1-26. Due to the special 
nature of this problem, the optimum change 
bb can be determined in closed form. For a 
detailed derivation of this optimum perturba¬ 
tion, the reader is referred to Chapter 5. For 
discussion here, the results of this calculation 
will be denoted by 

bb^rtB^C (1-28) 

where the vectors B and C depend on 
constraint errors, and equations of the prob¬ 
lem. The parameter is an undetermined 
parameter that plays the role of a step size, 
when viewed in the geometry of design 
variable space. For example, if there are only 
two design parameters bi and 62 , the direc¬ 
tion of the desired change is shown by ^ in 
Fig. 1-5, and r? is a step size along that 
direction. In the terminology of optimization 
theory, B is known as the direction of 
steepest descent. It is analogous to the direc¬ 
tion one would go downhill in order to reduce 

1-11 


Digitized by L^ooQle 


A 



AMCP70e-192 



his altitude as rapidly as possible. It is clear 
that on normal hills, as in most design cost 
functions, the direction of steepest descent 
changes, depending on the location on that 
hill. For this reason, the direction of steepest- 
descent does not generally pass through the 
optimum point as shown in Fig. 1-5. 

There are many techniques for choosing 
the step size 17 . The one used in the steepest 



descent method is based on requesting a 
certain reduction in the cost function due to 
the changed 6b. This request, then, deter¬ 
mines the step size and one can calculate 6b 
from Eq. 1-28. This 6b is the best change in 
the estimated design variable This best 
change is then added to the initial estimate to 
obtain a new estimate that corresponds to a 
structure of less weight and that still satisfies 
the constraints of the problem, i.e., 

= ft(0)+ 6 fc. (1-29) 

This process is repeated as many times as 
required to obtain convergence to the mini¬ 
mum weight structure. 

The optimum towers for each of the four 
basic configurations chosen are shown in Figs. 
1-6 and 1-7, with a table of results being given 
in Table 1-1. These results were obtained 



Figure 1-6. Tower With Base Rigidly Fastened to the Earth 
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using a finite element model with approxi¬ 
mately forty elements so that the resulting 
structure has an essentially continuous distri¬ 
bution of material and spacing. The weights 
shown in Table 1-1, corresponding to no 
design variables are simply the weights of the 
optimum towers having uniform members and 
no variation in spacing. Note that there is a 
significant reduction in structural weight for 
the tapered optimum towers over uniform 
towers. Extensive examples of this kind are 
presented in Chapters 5, 7, and 9. 


1-52 APPLICATION OF THE STEEPEST 
DESCENT METHOD IN INTERAC¬ 
TIVE COMPUTER AIDED DESIGN 




(A) One Design (B) Two Design 

Variable Variable 

Figure 1-7. Toy^r With Base Simply Supported 
and Top Supported WM Guy Lines 


Very often in design problems, it is not 
practical to specify a unique cost function to 
be minimized, hence the formal optimization 
problem described in par. 1-5.1 does not 
apply directly. The fact that the vector B in 
Eq. 1-28 is a direction of steepest descent, 
however, is extremely valuable information to 
a designer. The application of this informa¬ 
tion to a structural design problem, using 
interactive graphics, is a technique which 
shows considerable promise in design. 


Consider, for example, the problem treated 
in par. 1-5.1. The initial estimate of the 
optimum tower was taken as a uniform tower. 
The components of the vector hb can be 
projected on a cathode ray tube, along with a 
picture of the structure as shown in Fig. 1-8. 
The algebraic sign of the components of 6b, 
corre^onding to each of the design variables, 
is an indication of the effect a change in that 
design variable will have on the cost function 


TABLE 1-1 

WEIGHTS OF TOWERS 






Guy-line 

Guy-line 

Guy-line 


Cantilevered 

Cantilevered 

Cantilevered 

Supported 

Supported 

Supported 

Number of 

Design 

Variables 

0 

1 

2 

0 

1 

2 

Best Weight 

W » 2440.6 lb 

W = 2111.4 

W= 1827.9 

W « 1563.99 

W= 1366.6 

W» 1265.71 

Height 
Cross-sec¬ 
tional area of 

h ** 63.7 in. 

hmax'S’--* 

►’max = 80.2 

h = 46 

*’max = «6.6 

hmax = 36-55 

member 

A = 7.96 lb 

A-637 

Amax-10.03 

A = 3.84 

A - 4.434 

Amax = «-98 
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Figure 1-8. Sensitivity to Design Variations 

of interest. For example, if bb^ were positive, 
this would indicate that an increase in the 
dimension will decrease the structural 
weight. On the other hand if the algebraic sign 
of bbi were negative, then an increase in bbi 
would increase the structural weight. Like¬ 
wise, the algebraic signs of 664 through 
bb^ indicate the effect that a change in these 
element areas will have on structural weight. 
These data give the designer valuable informa¬ 
tion, according to which he should change his 
nominal design to improve the structure, 
while still satisfying all the essential con¬ 
straints. 

Traditionally, in structural design by graph¬ 
ics, the designer puts areas and dimensions 
into a structural analysis routine and then 
requests a stress calculation, the results of 
which are shown on the screen of a cathode 
ray tube. This technique has been used by 
Lockheed-Georgia in the design of the C5A. 
While this technique has been quite useful in 
structural design, it is extremely difficult for 
the designer with only stress information to 
determine how he should change just one 
element in the structure to reduce overall 
structural weight. The difficulty comes in the 
interplay between structural constraints. If, 

1-14 


on the other hand, the designer has trend 
information that he can use in altering the 
distribution of material in a structure, he can 
better use his experience in making design 
improvements. This capability can be invalu¬ 
able to large-scale structural designers. It 
includes the effect of individual design vari¬ 
able changes on overall structural value, while 
taking into account the effect of that design 
change on all design constraints. 

In real-world structural design problems, 
the designer must design his structure for 
more than simply light weight. He must be 
concerned with structural vibration and 
buckling characteristics, since these are m^or 
sources of structural failure. Often, as in par. 
1-5.1, it is possible to determine design 
perturbations that have a desirable effect on 
such structural properties as natural fre¬ 
quency and weight simultaneously. Both of 
these factors can then be displayed on a 
cathode ray tube as shown in Fig. 1-9. In this 
case, bb^ indicates the direction in which the 
design variable should be changed to reduce 
structural weight, and bb^ indicates the direc¬ 
tion in which the variable should be changed 
to increase natural frequency. This informa¬ 
tion can then be used by experienced design 
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Figure 1-9, Sensitivity to Two Performance 
Indicators 
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personnel in making design changes that wiU 
have desirable effects on overall aircraft struc¬ 
tural properties, for example. This is extreme¬ 
ly important in large-scale structural design 
due to the difficulty in determining the effect 
of changes in an individual design parameter 
on several different structural properties. 
Computation of these data and interactive 
aspects of the technique are discussed in 
Chapter 5. 

This design technique is feasible from a 
computational point of view in that very little 
additional computer time is required to 
generate sensitivity information from stress 
and vibration analyses that are required. While 
most structural optimization work has been 
done in the batch mode, it is shown in 
Chapter 5 that utilization of the steepest- 
descent technique with interactive graphics is 
a much more practical way to design struc¬ 
tures, particularly in cases where several 
measures of structural performance are im¬ 
portant. 

Development and display of sensitivity 
information in design is a form of information 
transfer to design personnel. This technique 
depends on the availability of interactive 
graphics software and hardware, which are 
currently being developed. 

1-6.3 DESIGN OF ARTILLERY RECOIL 
MECHANISMS 

As an application of this same optimization 
technique to a weapon design problem, cer¬ 
tain aspects of the design of a lightweight 
artillery piece will now be outlined. The 
requirement was stated for a lightweight 
artillery piece that can be fired with very 
short implacement time. For this reason it 
was determined that the weapon must be 
capable of being fired while it is resting on its 


tires. A photograph of the first prototype of 
this weapon is shown in Fig. 1-10. 

The recoil mechanism for this weapon was 
designed according to traditional recoil mech¬ 
anism design goals. Namely, the objective in 
the design was for a constant retarding force 
which is transmitted by the recoil mechanism 
to the undercarriage, as shown in Fig. 1-11. A 
recoil mechanism was designed which de- 
livo-ed approximately this recoil force/?(r) as 
a function of time. 

When the weapon was built and fired, a 
nearly constant recoil occurred, as desired; 
but, at high angles of fire, the weapon 
exhibited unacceptable dynamic response. 
During firing, the tires of the weapon com¬ 
pressed and after firing and the subsequent 
release of the recoil forces, the weapon 
rebounded off the ground approximately 6 in. 
This unacceptable behavior required a re¬ 
design cycle for the recoil mechanism with a 
design goal of minimizing the dynamic re¬ 
sponse, or hop, of the weapon after firing. 

It was determined that the peak recoil 
force could be allowed to reach 22,000 lb 
without damaging the support structure. The 
optimization problem is then to determine 
the recoil force R(t) as a function time such 
that 

/?(/)< 22,000 (1-30) 

and the peak dynamic response, denoted by 

/= max lh(t)] (1-31) 

i 

is as small as possible, where /i(0 is the height 
of the tires off the ground at any time r. 
Graphically, this problem is to determine a 
recoil force which lies beneath the 22,000-lb 
level in Fig. 1-12, and which minimizes the 
peak dynamic response of the weapon. In this 
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Figure 1-10. Howitzer, Towed, 105 mm, XM164 


problem, the dynamic response h{t) is deter¬ 
mined by the second order differential equa¬ 
tions of motion of the artillery piece. 



Time, tec 

Figure /- 11. Traditional Recoil Design Goat 
1-16 


The same philosophy of small design 
changes about some nominal estimate, as in 
the structural design problem of par. 1-5.1, 
was employed in this case. A sensitivity 



Hme, sec 

Figure hi2. Recoil Distribution in Time 
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function is first determined, which indicates 
the desirable direction of change in the 
nominal design variable. For example, taking 
the previously designed constant retarding 
force as the nominal base line, a sensitivity 
function is determined as shown in Fig. 1-13. 
If a constant multiple of this function is 
added to the retarding force, a reduction will 
occur in peak dynamic response and other 
constraints of the problem will continue to be 
satisfied. The dotted curve in Fig. 1-13 shows 
the altered design, which gives better charac¬ 
teristics than the original design estimate. 


22,000 

20,000 




Figure hi3. Sensitivity to Gun Hop 



Figure hi4. Optimum Recoil Curve 


This sensitivity information could easily be 
displayed on the screen of a cathode ray tube 
and could be used by design personnel in 
determining desirable changes in the recoil 
design. Even in this relatively simple problem 
it was not clear in what way the design should 
be altered to obtain improved response of the 
artillery piece. This particular problem was 
solved in the batch mode by doing many 
small step iterations of the kind previously 
described until convergence to an optimum 
was obtained. The optimum recoil force curve 
is shown in Fig. 1-14 and resulted in a peak 
dynamic response of less than 0.5 in. Detailed 
solution of this problem is presented in 
Chapter 8, par. 8-5. 
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CHAPTER 2 

FINITE DIMENSIONAL UNCONSTRAINED OPTIMIZATION 


2-1 INTRODUCTION 

In many engineering design problems cer¬ 
tain information which helps to prescribe the 
object being designed is specified. However, a 
certain number of parameters called design 
parameters are left open to the designer’s 
choice. These parameters must uniquely 
determine the object if the optimal design 
problem is to be meaningful. In the discussion 
which follows, the design parameters will be 
denoted by jC|...x^ or in vector notation 
simply as X =(x,.x„)^. 

In virtually all design problems there are 
restrictions on the object being designed. 
These may include the performance required, 
physical limitations such as size, weight, 
resource limitations, and organizational poli¬ 
cy. These restrictions or constraints generally 
will involve the design parameters so that the 
range of values of design parameters may be 
restricted. If the vector of design parameters 
(hereafter called the design parameter) is 
viewed as an element of real Euclidean space 
/?", then the effect of the listed restrictions is 
to confine the designer’s choice of design 
parameters to a subset D of called the 
admissible set of design parameters. The 
nature of this set will be determined by the 
nature of the requirements placed on the 
system being designed. This aspect of the 
optimal design problem will be treated ex¬ 
tensively in later chapters. 

When one speaks of optimal design, he 


must be able to choose, out of a collection of 
objects which satisfy the restrictions of the 
preceding paragraph, that one which is 
“best”. More specifically, out of all design 
parameters in the admissible set D, the de¬ 
signer must pick that one, x,which describes 
the “best” system. This discussion has still 
not given the meaning of “best”. An effective 
way of defining “best” is to give a real valued 
function whose domain of definition is the 
admissible set D, say f{x). “Best”, then, may 
be taken as the minimum or maximum of f(x) 
for X in D, If the function/(x) is a cost of the 
system being designed, then it is to be 
minimized. If, on the other hand, /(x) is a 
return or profit, it is to be maximized. 

The cost or return function will be defined 
in each optimal design problem. As a result, 
very little can be said about its nature in 
general. It is clear, however, that maximizing 
a real valued function r(x) is equivalent to 
minimizing - r{x). Therefore, optimal design 
problems may always be put into a form 
which may be interpreted as minimization of 
a cost function. For convenience this will be 
done in the following development. 

Example 2-1: As a hypothetical optimal 
design problem let the scalar x be the design 
parameter and f{x) = (x - 2)^ be the cost 
function. In Fig. 2-1 the cost function is 
plotted versus x. It is clear that the minimum 
cost of zero occurs at x = 2. 

Example 2-1 is included here as an aid to 
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intuition in more complex problems. Even 
when X is an n-vector, one can think of 

plotting the cost function above the x^yper- 
plane to obtain the cost surface. The optimal 
design problem is then to find the lowest 
point on this surface. 

Even though real-world optimal design 
problems invariably have constraints placed 
on the design parameter, the methods pre¬ 
sented in this chapter will ignore constraints. 
There are two reasons for considering this 
simplified problem in some detail. First, it 
may happen that the design parameter x that 
minimizes f{x) lies in the interior of the 
admissible set D, In this case the constraints 
play no part in locating 3c. Second, even 
though the point 3c may push some constraint 
to its limit and lie on the boundary of D, 
there are iterative methods for finding 3c 
which require minimization of an auxiliary 
cost function, subject to no constraints at 
each iteration. Methods which take con¬ 
straints into account are presented in Chap¬ 
ters 3 and 4. 

Two basically different methods of solving 
unconstrained minimization problems are pre¬ 
sented in this chapter. The first method, 
called the indirect method, is based on de¬ 
rived properties of the cost function at its 
minimum; i.e., if one pictures himself as being 
at the lowest point of the cost surface (x = 2 
in Fig. 2-1), he may notice that the surface is 
required to have certain special properties 
there. He may then use these special prop¬ 
erties to locate the lowest point on any such 
surface. This intuitive idea is made rigorous in 
par. 2-2. 

The second method of solving optimization 
problems is more direct in nature and is 
appealing from an engineering point of view. 
The designer initially chooses a design param- 



Figure2-1. ffx)=(x - 2P 


eter which is admissible, say x^®\ This choice 
of design parameter will probably not put him 
at the lowest point on the cost surface. 
Rather than discarding this nonoptimal point 
and picking another trial point at random he 
might attempt to find a second point x^^^ 
which is closer to the lowest point of the cost 
surface. The designer’s view of the cost 
surface is limited to only a small area due to 
the local nature of mathematical tests which 
he may perform. Using only this local 
information, he chooses a strategy which 
insures that he makes a move to a new point 
x^‘^ which is lower than x^*^. The direct 
methods presented in pars. 2-3 to 2-7 are just 
a mathematical implementation of these 
elementary ideas. 

2-2 NECESSARY CONDITIONS FOR EX¬ 
TREMA 

As described in par. 2-1, the approach 
taken in the indirect method is to assume/(x) 
has a minimum at 3c and then derive condi¬ 
tions which f(x) must satisfy there. These 
conditions may then be used to find the 
minimum point of any real valued function 
/tx). They are valuable in giving the designer 
an insight into the minimization portion of an 
optimal design problem, even when he is using 
direct computational methods to solve the 
problem. Before these ideas may be devel¬ 
oped, several definitions are required. 
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Definition 2-1 : A real valued function f{x) 
defined on a subset D of has an absolute 
minimum at jc in Z) if 

m<fix) ( 2 - 1 ) 

for all X in D. The function gix) has an 
absolute maximum at 3c if - g(x) has an 
absolute minimum there. The minimum is 
called strict if only strict inequalities hold in 
Eq. 2-1 forx # J. 

Note that f{x) can have a strict absolute 
minimum at only one point in D whereas it 
could have an absolute minimum at several 
distinct points in D provided it has the same 
value at all these points. 

Definition 2-2: A function f{x) defined on 
a subset D of has a relative minimum 
(maximum) at 3c if there exists an e > 0 so 
that f{x) has an absolute minimum (maxi¬ 
mum) in a subset of D whose points satisfy 

|x^ -3cJ < e, i= 1,..., n. 

Verbally, this definition says that/(x) has a 
relative minimum at 3c if it has an absolute 
minimum in a sufficiently small neighborhood 
of 3c. It is clear that if f{x) has an absolute 
minimum at 3c, then it also has a relative 
minimum there. The converse is not neces¬ 
sarily true. 

Example 2-2: Locate all relative and abso¬ 
lute maxima and minima of f{x) on 0 < x < 
3, where f[x) is given graphically in Fig. 2-2. 

The function /(x) has a strict absolute 
maximum at x = 1, absolute minima (not 
strict) at X = 0 and 2, relative maxima at x = 1 
and 3, and relative minima at x = 0 and 2. 

In Definitions 2-1 and 2-2 no continuity or 
differentiability requirements were placed on 



Figure 2-2 A Cost Function 

fix). Without making some assumptions as to 
the r^larity of fix) it is difficult to verify 
the required inequalities. Consider the case of 
a function fix) of the real variable x which 
has two continuous derivatives. The Taylor 
formula is 

/(3c + /i)=/(3F)+/'(3c)/i 

'^^f'\x^eh)h^ ( 2 - 2 ) 

where 0 < 0 < 1. Since /"(3c + Oh) is bounded 
for /t in a closed bounded set, it is clear that if 
/'(3c) #0 then for small enough h the linear 
term in h dominates the squared term so that 
fix 4- h) may be made both larger and smaller 
than /(3c) through choice of the appropriate 
sign of h. Therefore, in order for/(x) to have 
a relative minimum or maximum at 3c it is 
necessary that /'(3c) = 0. It foUows directly 
from Eq. 2-2 that if /'(3c) = 0, then/"(3c) > 0 
(< 0) is a sufficient condition for fix) to have 
a relative minimum (maximum) at 3c. 

In case x is in results analogous to 
those just obtained are given in Theorem 2-1. 

Theorem 2-1: Necessary Condition: Let 
fix) be defined on a subset D of R^ and have 
a continuous derivative in a neighborhood of 
a point 3c which is in the interior of D, If/(x) 
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has a relative minimum at x then 
V/i(J) = 0. (2-3) 


Sufficient Condition: Let Ax) have two coi^ 
tinuous derivatives in a neighborhood of x 
and let Eq. 2-3 hold. Then if the matrix 



(2-4) 


is positive definite, Ax) has a relative mini¬ 
mum at 3c. 


For convenience in later discussions. Defi¬ 
nition 2-3 is made. 


Definition 2-3: A point at which Eq. 2-3 
holds is called a stationary point of/(x). 


It is imperative that the reader be aware of 
the hypothesis of Theorem 2-1 which requires 
3c to be in the interior of the region D. The 
theorem does not apply if x is on the 
boundary of D. Example 2-2 illustrates this 
requirement graphically. Points x = 0 and x = 
3 of Fig. 2-2 yield a relative minimum and a 
relative maximum, respectively, but neither 
point is stationary (i.e., neither satisfies Eq. 
2-3). The same example also illustrates the 
need for verification of the differentiability 
properties of /(x). Even though x = 1 yields 
an absolute maximum of Ax) and is in the 
interior of D, it is not a stationary point since 
Ax) does not have a continuous derivative 
there. This example illustrates the need to 
faithfully verify all the hypotheses before 
Theorem 2-1 is employed. 

In order to verify the sufficiency condition 
of Theorem 2-1, one must have a verifiable 
test for positive definiteness of a matrix. 
Probably the most useful test is the following 
(Ref. 2, page 103): A symmetric matrix 


2-4 


A= ((^if)nxn positive definite if and 
only if the determinate of each of the 
matrices A , formed from the first m rows 

m 

and first m columns of >4, is positive, m = 
l,...,n. 


Example 2-3: Obtain explicit necessary and 
sufficient conditions for /(X 1 .X 2 ) to be a 
minimum and a maximum at 3c. where/{xi, 
X 2 ) has two continuous derivatives in D and x 
is an interior point of D. 

As necessary conditions for either a mini¬ 
mum or a maximum, Eq. 2-3 demands 

A sufficient condition for x to be a 
minimum point for /fx) is that in addition to 
the above equations, the matrices 

have positive determinates, i.e., 


xx 




The function /[x) has a relative maximum 
at X if the function j(x) = - /fx) has a relative 
minimum there. Therefore, in addition to 
_y = (3f)=0 sufficient conditions 

for*i(x) to have a relative minimum at x are 


For a relative maximum of /(x) at x then 
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sufficient conditions are 
/r r (^<0and4 

-[/, ^ (3?)]" > 0 . 

^ ix 2 


(^fx. 


(X) 


Thus far in this paragraph only properties 
of fix) precisely at the minimum point have 
been investigated. If the designer viewed the 
graph of fix) versus x to be a surface, then 
Theorem 2-1 tells him what the surface will 
look like when he finds its lowest point. 
Theorem 2-1, however, does not tell him that 
a lowest point exists. In order to solve his 
optimization problems, the designer would 
like to have tools which allow him to stand 
back from the cost surface and learn some¬ 
thing about its global properties. Two theo¬ 
rems are now stated which give him a better 
overall view of the optimization problem. 

Theorem 2-2: If fix) is continuous on a 
closed and bounded subset D of y then/(x) 
has an absolute minimum in D. 


< 1, is also in D. A real valued function fix) 
defined on a convex set D is called a convex 
function onZ) if for any two points y and z in 
D 


f[y^eiz-y)]< fly) + e Ulz) -/(y)], 
0 < 0 < 1 . 

That is, fix) is convex on D if the straight 
line segment fly) + B[flz) - fly)] is above the 
graph of fix) on the line segment y + ^(z - y) 
in Z), 0 < 0 < 1. For a more detailed 
discussion of convex functions, see Appendix 
A. 


Theorem 2-3 gives the designer valuable 
information about the global properties of the 
cost function. It is proved in detail in Chapter 
4. 


Theorem 2-3: Let fix) be a convex func¬ 
tion defined on a convex set D in . Then a 
relative minimum of fix) on D is also an 
absolute minimum of fix) onD. 


This theorem does not hold, in general, if 
any of the hypotheses are deleted. For ex¬ 
ample, consider the function fix) -xonD- 
(x| 0 < X < 1). D is not closed and/(x) does 
not have an absolute minimum in D. If 
5={x|0<x< l} then 5 is closed and fix) 
has an absolute minimum at x = 0. 

Note: The hypotheses of Theorem 2-2 may be 
weakened by demanding that fix) be only 
lower semi-continuous rather than continu¬ 
ous. For proof, see Ref. 1, page 58. 

Theorem 2-3 depends on the concept of 
convexity. 

Definition 2-4: A subset D of is called a 
convex set if whenever x and y are in Dy then 
the straight line segment x + 0(y - x), 0 < 0 

► 


This theorem is of obvious value to the 
designer. It assures him that if his design 
problem satisfies the hypotheses of Theorem 
2-3 and if he has found a relative minimum 
then he is through; he has also found the 
absolute minimum. 

Computational methods for finding ex¬ 
trema based on the theorems of this para¬ 
graph generally involve the solution of non¬ 
linear algebraic equations. In particular, Eq. 
2 -3, which is in general nonlinear, can be 
solved by a numerical method to locate all 
admissible interior extrema. Methods for solv¬ 
ing such equations are given in Ref. 3, 
Chapter 2. It generally has been found, 
however, that direct methods for finding 
extrema are superior to the solution of Eq. 
2-3. For this reason no computational 
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methods based on the indirect method will be 
presented here. 

It will be the purpose of the remainder of 
this chapter to present methods that the 
designer may use to locate interior relative 
minima. Relative minima on the boundary of 
the admissible region will be treated in Chap¬ 
ters 3 and 4. 

2-3 ONE-DIMENSIONAL MINIMIZATION 

In the direct minimization methods to 
follow, a multidimensional minimization 
problem will be reduced to a sequence of one¬ 
dimensional minimization problems; i.e., the 
problem of determining a scalar a so that a 
given function g(a) will be a minimum. 

In the problem of minimizing f[x) for x in 

, all the methods of solution presented in 
this chapter are based on successive improve¬ 
ments in certain directions; i.e., at a point x^‘^ 
one finds a direction, 5, in which fix) de¬ 
creases. The object is now to move along the 
vector x^^^ + 05 , by adjusting a, a > 0 , until 
fix) is as small as possible. The resulting point 
is then called jc^' ^ ^ and the entire process is 
repeated. It is clear that the intermediate 
problem of determining a so as to minimize 
f{x^^^ + as) is one-dimensional. This paragraph 
will be devoted to presentation of methods 
for solving the one-dimensional problem. 

2-3.1 QUADRATIC INTERPOLATION 

If the function f{x^‘^ + as) of the scalar 
variable a - and the unit vector s are 
fixed - were quadratic in a, then the value of 
a which minimizes the function could be 
found by setting 

£-[/(x<'>+ai)j=0. 

2-6 


The object here is to treat more general 
functions, but it is possible to make a 
quadratic approximation to f [x^^ + osl 
which will hold near the minimum point. 
Then, the minimum point of the approximat¬ 
ing function, which may be easily found, is an 
approximation of the true minimum point. 

The quadratic approximation of f + 
as] is constructed by passing a quadratic 
curve in a through three computed values of 
the function. The distance between the three 
trial points will be 6 > 0 , where 6 is initially 
chosen to be a small fraction of the expected 
range of a. It is known, however, that if the 
starting point of the process is quite far from 
the minimum point then the minimum point 
of the approximating function may not be 
near the true minimum point. To prevent 
making large, inaccurate steps in this case, a 
maximum allowable step size A is chosen 
before the process begins. A reasonable choice 
of A is 50% of the expected range of a. 

The following algorithm implements the 
procedure described: 

Step 1. Define a® = 0 and / = 1. 

Step 2. Compute 

/, =/[x<'>+(0(/-* -6)5 
/o =/[x<« +a^‘ 5j 


f2 +(a^'‘ +6)5). 


Step 3. A quadratic polynominal in a - 
o(^'* = z is fitted through (— 6 , /i), 
(0, fo), ( 6 . /j)- Its minimum is 


+ /2 # 0 . If this quantity is zero, 
then the approximation is a 
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straight line with minimum at = 
± A, depending on which of /j and 
/2 is smaller. 

Step 4. Define 

dot = min (|z^ l,A)sgn(z^) 
and 

Of/ = + f/a 

Step 5. If \da\ is less than a specified 
tolerance, the process is stopped 
and oJ is taken as the minimum 
value of ot. Otherwise, replace / 
with / + 1 and return to Step 2. 

2-3.2 FIBONACCI SEARCH (OR GOLDEN 
SECTION SEARCH) 

The Fibonacci search technique is a 
method based on isolating a relative minimum 
in an interval and successively decreasing the 
size of the interval. The process thus gives 
successively better estimates for the location 
of the minimum point. For a proof that the 
method converges very rapidly the reader is 
referred to Ref. 4, page 236. Here, only the 
basic ideas behind the method will be given, 
and an iterative algorithm stated. 

Starting at a = 0 one might evaluate f[x^^ 
+ 05 ] at o = 6 and check to see if the 
functional value is smaller than at o = 0. If it 
is, one might then evaluate the function at o = 
26 and compare with the value of o = 6. 
Again if a decrease occurs, one moves on to o 
= 36, etc. The process will terminate when 
/[jc^^+(k + l)65] >/[x<^+k65 ]. It is 
then known that (Jk - 1)6 < o < (k + 1)6 
contains the minimum point and a more 
accurate result, if required, may be obtained 
by reducing 6 and repeating the process from 


01 - (Jk — 1)6. If the initial step 6 was too 
small, many steps will have to be made before 
the minimum point is located. 

In Fibonacci search the same basic proce¬ 
dure is followed except that if, after a given 
step, the functional value has decreased, then 
the next step size is taken as 1.618 times the 
previous step size. In this way if the minimum 
point is a long way from a = 0, the Fibonacci 
technique wiU isolate it much more rapidly 
than the previous method with constant step 
size. Note that there is a penalty, in that the 
interval which contains the minimum point 
may have length much greater than 26. This is 
illustrated in Fig. 2-3. 



Figure 2’3. Function of Single Variable 

Once the minimum point is restricted to 
some interval, this interval is broken up into 
three subintervals by inserting points located 
a distance of 0.382 times the length of the 
interval from each end. A test is then per¬ 
formed to see which subinterval the minimum 
point lies in. For a given subinterval the 
partitioning is shown in Fig. 2-4. 

0.382 0.382 (a^- a^) 


f-1-1-1 

^ % % «u 

Figure 2-4, Interval Partition 
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The search process is terminated when the 
minimum point is isolated in a sufficiently 
small subinterval. 

The Fibonacci search method has the prop¬ 
erty of being best in a certain sense among all 
search techniques which isolate a in an 
interval. A measure of the effectiveness of any 
such technique is the ratio of the length of 
the largest interval in which ot may lie after n 
steps to the length of the original interval 
which contained a. It is shown in Ref. 4, page 
253, that if f + ocs] has a unique relative 

minimum as a function of a, then Fibonacci 
search minimizes the number of interval 
partitions. 

The Fibonacci search technique may now 
be given in the form of a computational 
algorithm: 

Step 1. First an upper bound must be 
found for a, . It is clear that 0 is 
a lower bound, ag. For a chosen 
small step size 5 in a, let / be the 
smallest integer such that 

/ + [ i 6(1.618)*]i) 

y *-o I 

>/|x<'> + [V 5(1.618)*)5| . 

I it-0 f 

Then upper and lower bounds on are 

a = S 6(1.618)^ 
k"0 

h2 

a = 2 6(1.618)^. 

“ it-o 

Step 2. Compute/ where 

+0.382(a^ -ttg) 

ocj, =ag +0.618(a^ - a^). 


/-I 

Note that = 2 6(1.618)* so /[jc<^ + 
a^5] is known. 

Step 3. Compare / and / 

+ and go to Step 4, 5, or 6. 

Step 4. If / + a^5] < / + a^s I, 

then ttg < By the choice 

of and ot ^, the new points = 
and have = a^. 

Compute now / 
where a' = aj + 0.382 (a^ - a'). 
Go to Step 7. 

steps. If/[x^^+a^5] >/[x^^+a^5j, 
then a < Similar to the 

m If 

procedure in Step 4, put a' = 
and so that a' “ 

Compute flx^^ + a^ 5 ] where = 
<+0.618(a; 

Go to Step 7. 

Step 6. If / = / [x^^ , 

puta;=a^anda;,=a^. 

Return to Step 2. 

Step 7. If - a' is suitably small, put 

0^^^ “ + CTg) and stop. 

Otherwise, delete the primes on a', 

, and and return to Step 

3. 

2-4 THE METHOD OF STEEPEST DE¬ 
SCENT (OR GRADIENT) 

The simplest and probably the best known 
of the direct methods of minimization is the 
Method of Steepest Descent (or Gradient). 
This method is based on the fact that if the 
cost surface is smooth, then its tangent plane 
is a good approximation to the surface near 
the point of tangency. The philosophy of the 
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Method of Steepest Descent is apparent in its 
title. One wishes to change by an incre¬ 
ment dx in such a way that /(x), x = x^^^ + 
dx, is reduced as much as possible for a given 
length of increment. The direction of the 
increment dx is called the direction of steep¬ 
est descent. 

The direction of steepest descent is given 
by Theorem 2-4. 

Theorem 2-4: Lctf{x) be differentiable in 
/?". The direction of steepest descent at a 
point 3c is 

dx = -aVf'^Cx) (2-4) 

where a > 0 is a scalar factor. 

The proof of Theorem 2-4 illustrates clear¬ 
ly that the direction of steepest ascent is 

dx = aVf^(x) (2-5) 

for a > 0. The reader should note carefully 
that Eqs. 2-4 and 2-5 give only the direction 
in the design parameter space /?" which yields 
the maximum rate of change of f[x). Since 
the factor a is not determined explicitly, the 
size of step is not specified. 

In order to start the steepest descent 
iterative technique, the designer makes the 
best estimate of the design parameter avail¬ 
able, The gradient is then 

computed at and a new point is 
determined by 

where > 0 is chosen using methods of 
par. 2-3 so that / [x^®^ - aV/^(x^®^)] is a 
minimum as a function of a. If V/ [x^^M 4= 
0 then / [x^^M < / [x^^M , so x^'^ is taken 


as a better estimate of the minimum point 
and the process is continued until V/ [x^^^] = 
0 ordx is sufficiently small. This method may 
be given in compact form as the steepest 
descent algorithm: 

Step 1. Make the best engineering estimate 
x^®^ of the minimum point. 

Step 2. Compute V/[x^^M and define a 
normalized gradient s = 
V/^[jc^^^].Find a = 

which minimizes / [x^'^ +as ] 
(where i is the number of iterations 
completed). If V/ [x^'M = 0, ter¬ 
minate the process and x^*^ is a 
relative minimum point. 


Step 3. Put = x^^^ — If I I 
and IIV / II are less than 

predetermined limits, terminate 
the process and let be the 

approximation to the minimum 
point. Otherwise return to Step 2. 

It is interesting to note that successive • 
directions of steepest descent are orthogonal 
to one-another in this algorithm—i.e., 

V/Tjc <'■"*>] =0]. Tosee this, recaU 

that 0 ^^^ is chosen so that /[x^^^ - as] is a 

minimum in a. The necessary condition of 
Theorem 2-1 then requires 

0 = -^=--Z 

3a II V/[x<'M II /“I 9xj 

}L 1 _ 

3x, II V/’'lx('>] II 

V/(x<'-"‘) ] V/^[x<'M 


which was to be shown. 


In the case where x J » ** * 

view of the design variable space. The closed 
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Figure 2-5. Descent Steps 
curves in this figure are lines of constant fix). 


A relatively general convergence theorem 
pertaining to this algorithm will now be 
stated. The proof of this theorem may be 
found in Ref. 5, page 80. 

Theorem 2-5: Let f{x) be a continuous 
function defined on R'* and let be any 
point such that the closed set 

5= jocl 

is bounded, and f{x) is twice continuously 
differentiable on S. Let the matrix of second 
derivatives of/lx). 



satisfy the condition 


\y^Hy\<My^y 

for some M, every y in /?", and every jcin*S. 
Then for the sequence generated by 

the steepest descent algorithm: 

(1) A subsequence converges to a 

point 3c in 5 for which V/{3c) = 0. 

(2) / decreases monotonicaUy to 

/(3f). 


(3) If X is the only point in S for which 
V/(3c) = 0, then x^^ converges to 3c. 

Several things which Theorem 2-5 does not 
say are worthy of note. First, the theorem 
does not guarantee that the sequence of 
points generated by the Method of 
Steepest Descent will converge. Further, un¬ 
less the assumption of (3) holds, the sequence 
need not converge to an absolute minimum; it 
may converge to a relative minimum. 

The choice of the initial estimate x^®^ can 
have a great deal to do with the limit point of 
the algorithm if it does converge. If it is not 
known beforehand that a unique relative 
minimum exists, it is general practice to start 
the iterative process at several initial esti¬ 
mates. If the sequence x^^ converges to the 
same point 3c each time, then one is led to 
believe that he has indeed found an absolute 
minimum. Logic such as this can cause sleep¬ 
less nights, however, particularly if a decision 
involving considerable resources and perhaps 
even one’s job depends on the outcome. For 
this reason, the importance of at least making 
a serious attempt to apply theorems such as 
those of par. 2-2 cannot be overemphasized. 
Theorem 2-3, for example, if properly 
applied, may prevent many anxious moments. 

In spite of the simplicity of the Method of 
Steepest Descent, it has several severe restric¬ 
tions which are discussed in Ref. 5, page 159. 
These are: 

1. Even though convergence may be 
guaranteed by Theorem 2-5, an infinite num¬ 
ber of iterations may be required for the 
minimization of even a positive definite qua¬ 
dratic form. 

2. Each iteration is calculated indepen¬ 
dently of the others so that no information is 
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stored which might be used to accelerate 
convergence. 

3. The rate of convergence depends strong¬ 
ly on properties of the cost function. If the 
ratio of the largest and smallest eigenvalues of 
the matrix of second derivatives is large, the 
steepest descent algorithm generates short 
zig-zagging moves. Convergence is, therefore, 
very slow. 

For an extensive treatment of modifica¬ 
tions of the steepest descent method, which 
prevents certain of these difficulties, see Ref. 
4, Chapter 7. Several methods, presented in 
the next three paragraphs, do not suffer so 
severely from the problems just described. 

2-5 A GENERALIZED NEWTON METHOD 

In the Steepest Descent Method of par. 2-4, 
only first-order derivatives that determine the 
tangent plane of the cost surface are used to 
represent the behavior of this surface. One 
would expect that if second derivatives of the 
cost function were available, then a quadratic 
function could be constructed as an approxi¬ 
mation to the surface. The quadratic approxi¬ 
mation should allow for much better approxi¬ 
mation of the minimum point of the cost 
function. 

The idea of this method is to first use a 
second-degree Taylor formula as an approxi¬ 
mation to fix). If fix) is convex, or just con¬ 
vex near a minimum point then the minimum 
point of the quadratic should be near the 
minimum point of f{x). The minimum point 
of the quadratic approximation is then deter¬ 
mined analytically and is taken as a good 
approximation of the minimum point of f{x). 

In order to utilize Taylor’s formula in¬ 
cluding second degree terms, the following 


matrix is required 

H(x) = V^nx) = 



n xn 


Note that it is implicitly assumed here that 
f{x) has two derivatives. By Taylor’s formula. 


+ Ax] «/[jc^0)] + 

-h ^ (2-6) 

where Ax is a change in x^®^ In case/(x) is 
locally convex - convex in a neighborhood of 
x^®l- Theorem A-3 shows that/f [x^®M is 
positive semi-definite. If, in addition, 
H [x^®^l is positive definite, then it has an 
inverse. Further, / [x^®^ +Ax] in Eq. 2-6 is 
convex in Ax so Theorem 2-3 insures the 
existence of a unique minimum point of the 
quadratic function in Eq. 2-6. By Theorem 
2 -1, this unique minimum point is determined 
by 


V/^[x<0)] +^[x^0)] Ax = 0 


or 


Ax V/^ , (2-7) 

and the new estimate of the minimum point is 
x(^) =x<0) -hAx. 


Since Eq. 2-6 is just an approximation, x^^ ^ 
will probably not be the precise minimum 
point of /(x). Realizing that evaluation of 
H(x) requires computation of n (n + 1 )/2 
second derivatives of /(x), one might be 
tempted to improve the estimate for the 
minimum point before recalculating all these 
derivatives. 

An easy way of improving the estimate of 
the minimum point is to change the length of 
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the step Ax without altering its direction. The 
scalar a 1 will be determined by methods of 
par. 2-3 so as to minimize / + oAx ]. 

This procedure may now be put down in 
the form of a computational algorithm called 
Generalized Newton Method: 

Step 1. Make an engineering estimate 
of the minimum point of fix). 

Step 2. Compute 

'*>//-‘[jc^^>l V/^[x<*>] . 

where a = is chosen which 
minimizes 

as a function of a. Here, the index 
/ is the number of iterations com¬ 
pleted. 

step 3. If II V/ [x<'>J||and || x<'>|| 

are sufficiently small, terminate 
the process and take ^ ^ > as the 
minimum point of fix). Otherwise, 
return to Step 2. 

The Generalized Newton Method presented 
in this paragraph has been called the best for 
minimizing convex cost functions when 
second derivatives are available (see Ref. 5, 
page 162). Even in the case in which the cost 
function is nonconvex, if the starting point 
is near enough to a relative minimum 
point so that the cost function is convex at 
then good convergence may still be 
expected. 

In spite of the advantages of this method, it 
still has several shcilcomings. 

1. Even if fix) is convex, an inverse of 
2-12 


Mix) may not exist unless Mix) is 
strictly positive definite. 

2. In nonconvex problems an iteration 
does not necessarily decrease /[x^'^l 
when the current iterate is not near 
the minimum point. 

3. For many engineering problems, Mix) 
will be extremely messy if not im¬ 
possible to compute efficiently. 

Even in nonconvex minimization problems 
the Generalized Newton Method may be used 
in conjunction with a Steepest Descent Meth¬ 
od to form an extremely effective tool. The 
Steepest Descent Method has the property of 
making good progress even though only a 
poor estimate of the minimum point is 
available. As a relative minimum is ap¬ 
proached, however, the rate of convergence of 
the Steepest Descent Method decreases. At 
this point, however, the cost function should 
be convex since a minimum point is nearby. 
Therefore, the Generalized Newton Method 
may be employed for rapid convergence to 
the relative minimum point. 

2-6 METHODS OF CONJUGATE DIREC¬ 
TIONS 

In par. 2-4 it is pointed out that the 
Method of Steepest Descent had rather poor 
convergence properties in many problems 
because it uses only first-order approxi¬ 
mations (involving only first-order deriva¬ 
tives). Further, the Steepest Descent Method 
is not a learning process in that it does not 
store information from past iterations. The 
first deficiency is corrected in par. 2-5 where 
a Generalized Newton Method employing 
second derivatives is presented. This method, 
while having outstanding convergence prop¬ 
erties, requires the computation of nin + l)/2 
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second-order derivatives at each iteration (x is 
in ). In most engineering design problems 
this is an extremely tedious, if not impossible, 
task. Further, the Generalized Newton Meth¬ 
od is not a learning process. 

The methods presented in this paragraph 
require the computation of only first deriva¬ 
tives. However, by making use of information 
obtained from previous derivatives, con¬ 
vergence is speeded as the minimum is ap¬ 
proached. In fact, as one of the methods 
progresses, it develops an approximation to 
the matrix of second derivatives. In this 
reject the methods here have the desirable 
features of both the Method of Steepest 
Descent and the Generalized Newton Method. 

All Methods of Conjugate Directions are 
based on the philosophy “If a method works 
well in minimizing all positive definite qua¬ 
dratic forms, then it ought to work pretty 
well on any smooth cost function.” To be 
more specific, Copjugate Gradient Methods 
are guaranteed to minimize any positive 
definite quadratic form in n iterations (the 
design parameter is in /?"). Although this 
ideal behavior will not carry over to general 
cost functions, since a convex cost function 
often looks very much like a positive definite 
quadratic form, similar behavior could be 
expected. Experience has shown that this is 
the case. 

In order to be more precise, one makes 
Definition 2-5. 

Definition 2-5: Let A be a symmetric 
positive definite n x n matrix and 5', i = 0 , 
1,..., n - 1, be nonzero vectors in /?". The 
are called coqjugate with respect to A if 

5'^yl5^ = 0,/#/. (2-8) 

Since A is positive definite, the conjugate 


vectors S* are linearly independent. To see 
that this is true, form the linear combination 

"s* a,S‘ = 0, 

/-o ' 

where the are scalars. Multiplying this sum 
on the left by S^^A yields 

"s' a.Sf^AS‘=aiS/'^AS^ = 0 

I«0 ' ' 

and since S^^AS^ ¥= 0, cy = 0. Since ] was 
arbitrary, = 0, / = 0, 1,...» n - 1, and this is 
just the definition of linear independence. 

Consider now the problem of minimizing 
the convex function 

f(x) = B'^x + (2-9) 

where x is in 5 is an n x 1 matrix and A is 
a symmetric, positive definite, n x n matrix. 
The central idea of all methods based on 
conjugate directions is contained in Theorem 
2 - 6 . 

Theorem 2-6: Let 5®, .... 5"”^ be nonzero 
vectors in /?" which are conjugate with 
respect to the positive definite matrix A. 
Choose scalars X = i = 0, .... n - 1, 
successively which minimize 

/[x<^> +X5^1 (2-10) 

where/][x) is given in Eq. 2-9, 

;c(O=x<0> + 2^ X<*>5* (2-11) 

k^o 

and is any point in R^. Then x^"^ is the 
absolute minimum point of f{x) over R" . 

The two methods that follow are simply 
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based on different ways of generating con¬ 
jugate directions. There are an unlimited 
number of ways to generate conjugate direc¬ 
tions. Several ways are discussed in Ref. 6. 

2-6.1 THE CONJUGATE GRADIENT 
METHOD 

Given any set of n linearly independent 
vectors and a positive definite nxm matrix A 
a set of conjugate directions with respect to A 
can be generated by a Gram-Schmidt ortho- 
gonalization technique. Let v°, ..., 
be linearly independent vectors and define 
= v°. Now put 


Sk*i 


yk*l 


* AS' 

'■=“ S'^AS' 


Many sets of vectors v. could be chosen to 
generate coidugate directions. A natural 
choice, however, is the set of gradient vectors 
of fix), g* = where are defined 

in Theorem 2-6. Define 

5® =-g° 

5*^1 + 2 -- 5 ' ( 2 - 12 ) 

'“O S‘^A S‘ 


Alternatively, 


=v' +a,o5®. 

For A-conjugacy, it is required that 


gk*\ =_5* + i 


, * ili 'Lii.- 

'=0 S‘^A S‘ 


(2-13) 


=0 = 5«^/l(v' +a,o5«) 


Since fix) =~^x^Ax 


and 

tt| A “ * 

S^^AS^ 

Assuming 5*,...»iS* are i4-conjugate, put 
5**» =v**> 

For i4-conjugacy it is required that 

5' = 0 = V*** 5' + , ,5'’^ A S' 

where the second equality holds by 5-con- 
jugacy, so 


«*=V/r(x(*)j =Ax(*)+5, 

or from the proof of Theorem 2-6, 

Now, 

s‘ S‘ +S‘^A 

= 0./<Jt (2-15) 

due to i4-conjugacy of the 5' and 

=0,): = 0. n-1. (2-16) 


Of, 


‘* + 1/ 


AS' ^ 
S'^AS' 


k. 


By induction, the resulting directions are 
i4-conjugate and 


From Eqs. 2-13 and 2-14 

= 0,/<Jfc. (2-17) 
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Thus, the / = 0, 1, n - 1 are linearly 
independent and the / = 0,1, n - 1 are 
^-coi^ugate. 

The Conjugate Direction Method of Theo¬ 
rem 2-6 may now be applied using the 
conjugate gradients S^. The result is called the 
Conjugate Gradient Method. In order to apply 
this method to nonquadratic problems, it is 
first necessary to eliminate explicit depen¬ 
dence of the algorithm on the form of f{x). 

By definition. 


=Axf*^ +B=A[xi + \^'>Sf] +B 


or 


+\^AS/. (2-18) 

By Eq. 2-16 

= 0=gl^Sf + \^'>Sl'^A SK 


Thus, 




S'^A S! 


Substituting for from Eq. 2-12 and using 
Eq. 2-15, this is 


X(/) = 


gl'^sl 
S!'^ A S> 


(2-19) 


g’^*^'^AS‘ _ -g') 

S*^A 5' g*^g* 

By Eq. 2-17, for i < k, the right side of the 
above equation is zero. For i - k, 

g***^i4 .y* ^ 

S^^A 5* 

Substituting this result into Eq. 2-12 yields 

( _*+ir-*+i\ 

Eq. 2-20 now gives an algorithm for determin¬ 
ing the conjugate directions, even without 
knowledge of the matrix i4. 

For a general function/(x), 

g' = 

and the following algorithm for finding the 
unconstrained minimum of fix) is called the 
Conjugate Gradient Method: 

Step 1. Make an engineering estimate x^^^ 
of the minimum point and com¬ 
pute 


5® = - V/^ 


Step 2. For i = 0,1, ..., find a = which 
minimizes/ +a5^]. 

Step 3. Compute 

;cO>i)=jc(^)+a(05/ 


From Eqs. 2-18 and 2-19 


5/+1 =_ 


g‘^g‘ 


Now, 

) 


where 


2-15 


Digitized by L^ooQle 



AMCP 706-192 


Step 4. Terminate the process if 

II ‘>J II and |||| 

are sufficiently small. Otherwise, 
return to Step 2. 

When this algorithm is applied to problems 
in which /(x) is not of the form of Eq. 2-9, 
convergence will not occur in n steps. 
Fletcher and Reeves recommend that after n 
steps the algorithm should be “restarted”, i.e., 
jif(»+i) should be treated as in the 
algorithm. In a sense, the first few iterations 
of the algorithm build up information about 
the curvature of the cost surface. After n 
iterations, this information is discarded and a 
new estimate of curvature is built up during 
the next n iterations. This method then does 
not accumulate information about curvature 
of the cost surface over the entire iterative 
process. 

2-6.2 THE METHOD OF FLETCHER AND 
POWELL 

A second method based on a different set 
of conjugate directions was suggested by 
Davidon (Ref. 8) and modified by Fletcher 
and Powell (Ref. 9). This method is reported 
to be one of the most powerful known 
methods for general functions/(jc), (Ref. 10). 
A msgor reason for the success of this method 
is its capability to accumulate information 
about the curvature of the cost surface during 
the entire iterative process, even though only 
first order derivatives of the cost function 
need to be computed. 

The directions 5^^, generated by the al¬ 
gorithm that follows, are conjugate if /(x) is 
of the form of Eq. 2-9. This proof is given in 
Refs. 7 and 9. In Ref. 6 it is shown that the 
method of Fletcher and Powell fits naturally 
into a large class of conjugate direction 
methods. The derivation is tedious and lends 

2-16 


little insight into use of the method. For a 
direct proof of convergence, etc., the reader is 
referred to Ref. 7. 

The computational algorithm is: 

Step 1. Make an engineering estimate 

of the minimum point and choose 
a symmetric positive definite 
matrix 


Step 2. For i = 0,..., compute 

5(0 = _//«) 

Step 3. Compute a which minimizes 

Step 4. Compute 

a(0 = 

xU*n=x^i)+aW 
//((♦ l ) = //(0 +^(0 + 5(0 

where 

yd) = 


ylO^ ffa)yO) 


steps. If II II and ||x<''*‘> - 

xri'>|| are sufficiently small, termi- 
nate the process. Otherwise return 
to Step 2. 

Fletcher and Powell (Ref. 9) prove that this 
algorithm has the following properties: 
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1. The matrix is positive definite for 
all I. This implies the method will 
always converge to a stationary point 
since 

= - < 0 

provided ] # 0. This means that 

f\x^^ 1 may be decreased by choosing a 
> 0if/lJc<'>l ¥=0. 

2. When this method is applied to the 
positive definite quadratic from Eq. 2-9, 

converges to A~^. 

This method might be called a learning 
process in that only first derivatives are ever 
computed, but as the algorithm progresses an 
approximation of the matrix of second deriva¬ 
tives is generated. Many experienced re¬ 
searchers in the area of optimization methods 
laud this method as one of the best available. 

2-6.3 A CONJUGATE DIRECTION METH¬ 
OD WITHOUT DERIVATIVES 

Occasionally in applications, one is faced 
with a problem in which computation of 
derivatives of the cost function is impossible 
or at least prohibitive from a computational 
point of view. There are many techniques for 
solving this sort of problem given in Ref. 4. 
An efficient technique, not presented in 
common texts, was developed by Powell (Ref. 
11) using conjugate directions. 

A computational algorithm is presented 
here without proof. For a proof that the 
algorithm generates conjugate directions the 
reader is referred to Ref. 11. The computa¬ 
tional algorithm is: 


Step 1. Make an engineering estimate of 
the minimum point of f(x). 
Choose vectors 5^, / = 1, ..., n, in 
the coordinate directions of R". 

Step 2. Find a = a*, k = 1, ..., n, which 
minimize/[jc^*‘^^ +05*] 

where 

yk -ykA ^ofk^k ^ 

and / is the number of iterations 
which have been completed. Note 
that in the one dimensional mini¬ 
mization for a*, it is possible for 
o* < 0. 

Step 3. Find the integer m, 1 < m < n for 
which 

/O''" 1 )-/(>''") 
is the largest and define 
A=/0''"-‘)-/0''"). 

Step 4. Define /, = f{y° ), /j = /O'"), and 
/3=/(2y" -y"). 

Step 5. If/j > fi or 

(/» -2/2 +/j)x(/, - A)* 

>-y(/i -/3)^ 

put 

O = y^ , 

Terminate the process if 
- is sufficiently small. Other- 
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wise return to Step 2 with the 
same set of y. / = 1,..., n. 

Step 6. If neither of the inequalities of 
Step 5 hold, define s = -y ^)and 

find a = a which minimizes 


f 2 (Xi.X 2 ,X 3 .X 4 ) = iXi + lOXj)* 

+ 5(JC3 -JC,)* 

( 2 - 22 ) 

+ (X2-2X3)* 


f(y* -ha5). 
Put 


+ 10(;ci 


and 


^(*♦1) = yf* 

Terminate the process if 
- II is sufficiently small. Other¬ 
wise return to Step 2 with the new 
set of vectors ..., 5^**^*, 

...» 5 '*, s. 

For a discussion of use of the 
algorithm, see Ref. 11. 


2-7 COMPARISON OF THE VARIOUS 
METHODS 

During the development of the methods 
presented in this chapter, theoretical advam 
tages and disadvantages have been pointed 
out. As a concrete test of these methods, 
three functions will be minimized. Two of the 
functions to be treated are terribly behaved 
and pose a meaningful test to any general 
minimization technique. These functions re¬ 
semble a very deep valley at whose bottom 
the curvature in two orthogonal directions is 
radically different. The third function is 
quadratic and poses no serious obstacle to any 
reasonable method. More specifically, these 
functions are 


fl(Xi,Xi) = 10(KX2 - jcJ)* 


( 2 - 21 ) 


/j(x,,X2,a:3)=xJ + 2 xj +2X3 

(2-23) 

+ 2X1^2 + 2X2X^ 

The reader should verify that each of these 
functions has a strict absolute minimum 
point. These points are (1,1), (0,0,0,0), and 
(0,0,0), respectively. Each iterative method 
will be started at points (- 1,1), (1,1,1,1), 
and (1,1,1) for Eqs. 2-21, 2-22, and 2-23, 
respectively. These functions will all be mini¬ 
mized by each of the methods of pars. 2-4 
through 2-6. The stopping criterion will be 
that each component of the independent 
variable must be within 10“^ of the known 
minimum point. 


Results will be presented in tabular form so 
that a comparison of the behavior of each of 
the methods may be made. For the sake of 
uniformity, each table will include the itera¬ 
tion number /, the iterate .... 

x„^^^ 1 and the value of the cost function. 

2-7.1 METHOD OF STEEPEST DESCENT 


2-7.1.1 COST FUNCTION: Mx) = IOCKjc^ 

-x]f +(1 -X,)*. 


Exact solution: (1,1),/|(1,1) = 0 
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TABLE 2-1 


STEEPEST DESCENT METHOD - 
ITERATIVE DATA FOR COST 
FUNCTION fi U) 


/ 


^ (/) 

^ (/) 

0 

404.0 

- 1.0 

- 1.0 

1 

19.97 

0.2576 

- 0.3743 

2 

0.8654 

0.0707 

0.00067 

3 

0.318 

0.452 

0.1910 

4 

0.3048 

0.448 

0.199 

5 

0.2929 

0.472 

0.211 

6 

0.2828 

0.4685 

0.218 

29 

0.1752 

0.5861 

0.3373 

30 

0.1728 

0.5846 

0.3403 

73 

0.1081 

0.6739 

0.4499 

74 

0.1071 

0.6729 

0.4517 


2-7.1.2 COST FUNCTION: /jW = (x, 

+ 10X2 + 5 (X 3 - X4)^ + (Xj - 2X3)^ + 

10(Xi -X4)^ 

Exact solution: (0,0,0,0), /2(0) = 0 


TABLE 2-2 


STEEPEST DESCENT METHOD - ITERATIVE 
DATA FOR COST FUNCTION fi U) 


/ 


x ,« 

x ,« 

x,W 

K ,«> 

0 

122.0 

1.0 

1.0 

1.0 

1.0 

1 

16.43 

0.9055 

0.055 

1.0 

1.0 

2 

16.31 

02019 

0.023 

0.9958 

02581 

5 

16.03 

0.8925 

- 0.0498 

0.969 

0.746 

6 

15.06 

0286 

- 0.0756 

0.923 

0.463 

9 

1225 

0.641 

- 0.063 

0.699 

- 0.156 

10 

3.00 

- 1.048 

0.0746 

- 0.1608 

-02197 

11 

2.006 

- 1.039 

0.1522 

- 0.258 

- 0.815 

12 

1.380 

- 1.043 

0.078 

- 0.298 

- 0.752 

13 

1.188 

- 1.033 

0.1127 

- 0.3276 

- 0.7067 

14 

1D47 

- 1.021 

0.090 

- 0.362 

- 0.634 

15 

1.041 

- 1.015 

0.0949 

- 0.368 

- 0.619 

16 

1.040 

- 1.012 

0.0960 

- 0.370 

- 0.611 

38 

1.039 

- 1.008 

0.0967 

- 0.373 

- 0.603 

74 

1.039 

- 1.008 

0.0968 

- 0.373 

- 0.6019 


2-7.1.3 COST FUNCTION: /3(x) = xJ +2x^ 

+ 2x3 + 2x1X2 + 2x2X3 


Exact solution: (0,0,0), /3(0) = 0. 

TABLE 2-3 

STEEPEST DESCENT METHOD - ITERATIVE 
DATA FOR COST FUNCTION U) 


/ 



V 

„ f/J 

0 

9.0 

1.0 

1.0 

1.0 

1 

0.0714 

02857 

- 0.0715 

- 0.0715 

2 

0.01311 

0.1632 

- 0.153 

0.0512 

3 

0.0088 

0.1604 

- 0.114 

0.065 

4 

0.00679 

0.1245 

- 0.062 

0.053 

5 

0.00243 

0.078 

- 0.0625 

0.0204 

6 

0.0018 

0.073 

- 0.0476 

0.02727 

7 

0.00063 

0.0218 

- 0.00305 

0.0133 

8 

0.00006 

0.014 

- 0.00956 

0.0035 

9 

0.00005 

0.011 

- 0.00686 

0.00485 

10 

0.00003 

0.0041 

- 0.0036 

0.0040 


It should be noted that the Steepest De¬ 
scent Method decreased the cost function 
rapidly on the first iteration but in the first 
two problems failed to converge to the 
minimum point. That is typical behavior for 
this method, particularly in problems for 
which the cost function has a long sharp 
valley. It should be clear that blind use of the 
Method of Steepest Descent can yield poor 
results. 

2-7.2 GENERALIZED NEWTON METHOD 
2-7.2.1 COST FUNCTION: /,(x) = 10(Xx2 

-x\)^ +(1 -X,)* 

Exact solution: (1,1),/i(l,l) = 0 
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TABLE 2-4 ♦Note: The trial starting point (1,1,1,1) was 

a singular point for VV 2 so an alternate 
GENERALIZED NEWTON METHOD - ITERATIVE Starting point was chosen and the algorithm 
DATA FOR COST FUNCTION converged. 

fiix) 


/ 


X 

„ (!) 

0 

404.0 

-1.0 

-1.0 

1 

3.981 

-0.9950 

00869 

2 

3.403 

-0.7919 

00832 

3 

2.588 

-0.5248 

02241 

4 

1.548 

-0.1832 

-0.5105 

5 

0.953 

0.0887 

-0.0271 

6 

0.473 

0.3642 

0.1063 

7 

0203 

0.5955 

0.3347 

8 

0.0531 

0.8020 

0.6315 

9 

0.0042 

0.9536 

00049 

10 

0.0002 

0.9900 

00810 

11 

2x 10’^ 

1.0003 

1.0007 


2-7.2.2 COST FUNCTION: ^(jc) = (x^ + 
10^2)2 -I- 5(^3 - JC4)^ +(JC 2 - 2 X 3 )^ + 10(x, 

-^ 4 )^ 

Exact solution: (0,0,0,0), ^(O) = 0 


TABLE 2-5 

GENERALIZED NEWTON METHOD - ITERATIVE 
DATA FOR COST FUNCTION 


_■ 


w (0 

^1 

„ (i) 

X2 

X3 

^4 

0 

137.0 

1.0 

1.0 

1.0 

2.0* 

1 

2.137 

-0.3368 

0.0175 

0.3396 

0.3249 

2 

0.0496 

-0.0640 

0.0250 

0.1060 

0.1229 

3 

0.0025 

-0.0591 

0.0047 

0.0627 

0.0617 

4 

0.0007 

-0.0236 

0.0031 

0.0263 

0.0271 

5 

0.00001 

-0.0148 

0.0014 

0.0161 

0.0160 

6 

1x10'^ 

-0.0070 

0.0007 

0.0078 

0.0079 


2-7.2.3 COST FUNCTION: Mx)^x]^2xl 

2^3 +2x1^2 +2X2X3 
Exact solution: (0,0,0), ^(0) = 0. 

TABLE 2-6 

GENERALIZED NEWTON METHOD - 
ITERATIVE DATA FOR COST 
FUNCTION f^ix) 

i X2<'» X3<'> 

0 9.000 1.0 1.0 1.0 

1 2x10"* 0.0015 0.0015 0.0015 


These results indicate that the Generalized 
Newton Method is indeed very powerful. 
Even in the second cost function where the 
initial estimate caused a singularity in v^/ 2 , a 
second starting point yielded good results. 
Similar behavior has been noted in the litera¬ 
ture, so one can expect to get good results 
with this method. It must be remembered, 
however, that this method requires that sec¬ 
ond derivatives of the cost function be com¬ 
puted. 

2-7.3 CONJUGATE GRADIENT METHOD 

2-7.3.1 COST FUNCTION: /,(x) = \00(x2 

-xj)*+(i -x^y 

Exact solution: (1,1),/,(1,1) = 0. 
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TABLE 2-7 


CONJUGATE GRADIENT METHOD-ITERATIVE 
DATA FOR COST FUNCTION 

fiix) 


/ 


^ m 

X2 

0 

404.0 

-1.0 

-1.0 


9649. 

0.1143 

0.0102 

0.0839 

1 

9649. 

0.3258 

0.0102 


22.19 

0.5106 

02360 

2 

22.19 

0.5005 

02482 


0.5033 

0.6307 

0.3820 

3 

0.5033 

0.6244 

0.3882 


02226 

0.7267 

0.5178 

4 

02226 

0.7227 

0.5212 


0.001637 

0.9919 

02827 

8 

0.001637 

0.9842 

0.9868 


0.000067 

0.000013 

0.999754 

11 

0.000067 

0.999884 

0.999768 


2-7.3.2 COST FUNCTION: /^U) = (x^ + 
10 X 2 ^ + 5 (X 3 - JC4)" +(^2 - 2x3)^ + 10(X| 
-X^)^ 

Exact solution: (0,0,0,0), /2 (0) = 0. 

TABLE 2-8 

CONJUGATE GRADIENT METHOD - ITERATIVE 
DATA FOR COST FUNCTION 

fiix) 






^ 6 ) 

^ 6 ) 

0 

122 0 

1 0 

1 0 

10 

10 


2925 27 

0 9016 

0 0346 

0 9642 

1 000 


2925 27 

0 8632 

007787 

04158 

0 9960 


36 55 

0 8561 

00101 

03642 

0 6573 

1 

192 2 

0 8404 

0 0563 

0 3324 

04438 


192 2 

0 8180 

00860 

03185 

0 4540 


12 11 

0 7507 

0 0685 

0 2651 

04787 


3 023 

0 6410 

00650 

0 2533 

0 4104 

2 

26 29 

03404 

0 0281 

0 2192 

0 2025 


26 20 

0 3378 

0 0370 

0 2079 

0 2057 


1 651 

0 3331 

0 0319 

0 1748 

0 2130 


0 1136 

0 3159 

0 0371 

0 1473 

0 1698 

6 

0 0531 

0 0305 

0 0029 

00718 

00717 


0000751 

0 0293 

0 0031 

0 0696 

00714 


0 000751 

0 0294 

0 0028 

0 0695 

00713 


0 000556 

0 0297 

0 0029 

0 068 / 

0 0677 

8 

0 001091 

0 03519 

0 003519 

0 02338 

0 02503 


0 000512 

0 035322 

0 03498 

0 023112 

0 023119 


0 000512 

0 035318 

0 003530 

0 023108 

0 023119 


1 * 10 -’ 

0 035305 

0 003528 

0 023011 

0 023115 


2-7.3^ COST FUNCTION: h(x)^x\ ^Ix] 

+ 2X3 +2x1X2 +2x2X3 

Exact solution: (0,0,0), /3(0) = 0. 


TABLE 2-9 

CONJUGATE GRADIENT METHOD - 
ITERATIVE DATA FOR COST 
FUNCTION 


/ 


(/) 

w (/) 

„ (/> 
*3 

0 

9.0 

1.0 

1.0 

1.0 


0.1181 

0.3829 

-0.2340 

0.0744 


0.0293 

0.2571 

-0.2285 

0.1428 

1 

0 

0 

0 

0 


The numerical results presented here indi¬ 
cate that the Coi\jugate Gradient Method is 
very effective even for the Rosenbrock func¬ 
tion /i(x). The method requires approxi¬ 
mately the same amount of computation per 
step as the Steepest Descent Method but 
shows spectacularly improved performance. 


It should be noted, however, that con¬ 
vergence slows as the minimum point is 
approached. In fact, as shown in Table 2-8, 
convergence to the required accuracy was not 
attained in one case. 

2-7.4 FLETCHER-POWELL METHOD 

2 - 7 . 4 .I COST FUNCTION; /,(x) = 100 (jc* 

-xj)* +(1 -a:,)* 

Exact solution': (1,1),/i(l,l) = 0. 
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TABLE 2-10 


FLETCHER-POWELL METHOD-ITERATIVE 
DATA FOR COST FUNCTION U) 


/ 

fi u"^i 

^ (/) 

^ m 

X 2 

0 

404.0 

- 1.0 

- 1.0 

1 

19.97 

0.2570 

-0.3746 

2 

0.7839 

0.1146 

0.01249 

3 

0.7570 

0.1422 

0.005683 

4 

0.7424 

0.1727 

0.005740 

5 

0.5377 

0.3378 

0.08262 

6 

0.4013 

0.3689 

0.1416 

7 

02968 

0.4815 

02151 

8 

0.2524 

0.5616 

02909 

9 

0.03621 

0.8286 

0.6784 

10 

0.03216 

0.8207 

0.6733 

11 

0.02568 

0.8536 

0.7221 

12 

0.01162 

0.9268 

00511 

13 

0.00437 

0.9342 

00733 

14 

0.00106 

0.9760 

0.9504 

15 

8 x lO'^ 

0.9982 

00967 


2-7.4.2 COST FUNCTION: /,(jc) = (x, - 
IOjc,)* + 5(^3 -JC 4 )* +ix2 - 2xj)^ + 10(X, 

-X4/ 

Exact solution: (O.O.O.O), A (0) = 0. 


TABLE 2-11 

FLETCHER-POWER METHOD-ITERATIVE DATA 
FOR COST FUNCTION fj(x) 


/ 




X,« 

x.*'! 

0 

122.0 

1.0 

1.0 

1.0 

1.0 

1 

14.4292 

0.9017 

0.03472 

0.9642 

1.0 

2 

2.3775 

0.8630 

-0.07820 

0.4120 

0.9960 

3 

0.6678 

0.8430 

-0.08740 

0.3618 

0.4^6 

4 

0.3353 

0.2087 

-0.02560 

0.3644 

0.3305 

5 

0.05134 

0.1117 

0.006686 

0.1883 

0.1952 

6 

0.01059 

0.07931 

-0.009696 

0.1532 

0.1526 

7 

0iX)067 

0.02731 

-0.0007003 

0.06189 

0.06276 

8 

0.00016 

0.02164 

-0 002344 

0.05417 

0.05409 

9 

8.3x lO** 

0.00267 

-0.0000359 

0.0191 

0.0192 

10 

2.1 X lO'^ 

0.00148 

-0.000163 

0.0172 

0.0172 

11 

10-^ 

-0.0057 

0.00060 

0.00341 

0.00342 


2-7.4.3 COST FUNCTION: ACx) -I- 2Jr 

+ 2 X 3 + 2x,X2 + 2 X 2 X 3 

Exact solution: (0,0,0), ^(O) = 0. 


TABLE 2-12 


FLETCHER-POWELL METHOD - 
ITERATIVE DATA FOR COST 
FUNCTION 73 (x) 


/ 


^ (/) 

- (/) 

^ (/) 

0 

9.0 

1.0 

1.0 

1.0 

1 

0.05319 

0.3830 

-0.2340 

0.07447 

2 

0.02857 

0.2571 

-0.2286 

0.1429 

3 

3x i(r‘^ 

2 x i(r^ 

- 2 x i(r^ 

-3 X 10"^ 


The Fletcher-Powell Method requires slight¬ 
ly more computation than the Conjugate 
Gradient Method. However, its convergence 
properties are very good as the minimum 
point is approached, in contrast to the be¬ 
havior of the Coqjugate Gradient Method. 

This method appears to have good prop¬ 
erties in all ranges of the iterative process. It is 
more stable than the Generalized Newton 
Method in the early stages of computation 
and converges more rapidly than the Gradient 
and Conjugate Gradient Methods near the 
minimum point. In these respects it has the 
desirable properties of other methods without 
having their undesirable properties. 


2-7.5 CONJUGATE DIRECTIONS WITH¬ 
OUT DERIVATIVES 

2-7.5.1 COST FUNCTION: /,(x) = 100(x2 

-xj)' +(1 -X,)* 

Exact solution: (1,1),/,(1,1) = 0. 
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TABLE 2-13 


CONJUGATE DIRECTIONS WITHOUT 
DERIVATIVES METHOD-ITERATIVE DATA 
FOR COST FUNCTION fi U) 


/ 

ft lx"^l 

* 

^ (1) 

0 

404.0 

-1.0 

-1.0 


100.1 

0.0049 

-1.0000 


0.9902 

0.0049 

0.0000 

1 

0.9902 

0.0261 

0.0211 


0.9485 

0.0261 

0.0007 


0.9402 

0.0429 

0.0174 

2 

0.7922 

0.1287 

-0.0016 


0.7022 

0.1815 

0.0509 


0.6172 

0.2147 

0.0436 

3 

0.3958 

0.4058 

0.1440 

4 

0.2895 

0.4785 

0.2422 

5 

0.2591 

0.5308 

0.3015 

6 

0.0770 

0.7258 

0.5225 

7 

0.0282 

0.8564 

0.7246 

8 

0.0125 

0.8942 

0B033 

9 

0.0119 

0.9116 

0B373 

10 

0.0116 

0.9039 

0B218 

11 

0.0125 

0.9469 

0.9065 

12 

0.0042 

1.0363 

1.0792 

13 

0.0002 

0.9886 

05781 

14 

0.0002 

1.0032 

1.0079 


2-7.5.2 COST FUNCTION: ^(jc) = (jc, 
-10x2) + 5(^3 - X4) + ( X 2 - 2x3) - 

X 4 ) 

Exact solution: (0,0,0,0), /2(0) = 0. 

TABLE 2-14 

CONJUGATE DIRECTIONS WITHOUT DERIVATIVES 
METHOD-ITERATIVE DATA FOR COST 
FUNCTION faU) 


/ 


^ 0) 


^ fiJ 

X4 

0 

122.0 

1.0 

1.0 

1.0 

1.0 


109.1 

05051 

1.0000 

1.0000 

1.0000 


18.45 

05051 

0.1140 

1.0000 

1.0000 


7.667 

05051 

0.1140 

0.4819 

1.0000 

1 

2.371 

05051 

0.1140 

0.4819 

0.4284 


2.157 

0.0469 

0.1140 

0.4819 

0.4284 


1.075 

05469 

0.0127 

0.4819 

0.4284 


TABLE 2-14 (Continued) 


/ 

ft \x*"\ 


„ 0) 

•*2 

w (0 

^ (t) 


0.4421 

0.0469 

0.0127 

05799 

0.4284 

2 

0.1415 

0.0469 

0.0127 

05799 

05423 


0.1418 

0.0510 

0.0127 

0.2799 

05423 


0.1210 

0.0510 

-0.0015 

05799 

0.2423 


0.0498 

0.0510 

-0.0015 

0.1875 

0.2423 


0.0246 

0.0510 

-0.0015 

0.1875 

0.1749 

3 

0.0082 

0.0536 

-0.0104 

0.1291 

0.1324 

4 

0.0020 

0.638 

-0.0181 

0.0794 

0.0882 

5 

0.0018 

0.1322 

-0.0147 

0.0892 

0.0940 

6 

0.0010 

0.0828 

-0.0109 

0.0580 

0.0603 

7 

0.0005 

0.0412 

-0.0057 

0.0377 

0.0322 

8 

0.0000 

0.0078 

-0.007 

0.0058 

0.0050 


2-7.5.3 COST FUNCTION: + 2a:^ 

+ 2X3 + 2X1X2 + 2X2X3 


Exact solution: (0,0,0),/3(0) = 0. 


TABLE 2-15 

CONJUGATE DIRECTIONS WITHOUT DERIVATIVES 
METHOD-ITERATIVE DATA FOR 
COST FUNCTION ^U) 


/ 

filx"*] 

x/" 


x,W 

0 

9.0 

1.0 

1.0 

1.0 


5.000 

-1.0000 

1.0000 

1.0000 


3.000 

-1.0000 

0.0000 

1.0000 

1 

1.000 

-1.0000 

0.0000 

0.0000 


0.000 

0.0000 

0.0000 

0.0000 


The Conjugate Directions Without Deriva¬ 
tives Method is not as efficient as some of the 
methods that require computation of deriva¬ 
tives. However, there are many problems in 
which computation of derivatives is either 
impossible or very difficult. In these prob¬ 
lems, this method appears to be effective. 
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2-8 AN APPLICATION OF UNCON¬ 
STRAINED OPTIMIZATION TO 
STRUCTURAL ANALYSIS 

As pointed out earlier in this chapter, 
optimal design problems are seldom uncon¬ 
strained. There is, however, a large class of 
analysis problems which can be solved using 
unconstrained optimization methods. In Ap¬ 
pendix B, energy principles which govern 
equilibrium, vibration, and stability of struc¬ 
tures are given. The condition for equilibrium 
is particularly direct since it requires that, in 
problems for which the strain energy is 
quadratic, the equilibrium state, 3c, minimizes 
V of Eq. B-18, Appendix B, 

V = jx^Kx-x'‘'F. (2-24) 

Even in some problems which are nonlinear 
and the total potential energy is not qua¬ 
dratic, the minimum energy principle applies. 

In view of the quadratic form of Eq. 2-24, 
conjugate direction methods are indicated. 
Even for nonquadratic energy expressions, 
methods for conjugate directions appear to be 
very efficient. For a much more detailed 
treatment of this class of equilibrium prob¬ 
lems, see Ref. 12. 

A second structural analysis problem for 
which unconstrained optimization methods 
hold even more promise is the eigenvalue 
problem. As shown in Appendix B, vibration 
and buckling problems reduce to eigenvalue 
problems of the kind 

Ky^XAfy. (2-25) 

In this problem, the smallest eigenvalue X j of 
the Eq. 2-25 is sought. One method of solving 
this problem is to rewrite Eq. 2-25 as 

K-^My= yj/. (2-26) 

2-24 


In this form, an iterative technique such as 
the power (or iteration) method (Ref. 13, 
page 93) may be applied to obtain the largest 
eigenvalue of the matrix K~’^M and hence, 
the smallest eigenvalue of the original prob¬ 
lem. Even though the power method is 
efficient, this approach has the severe dis¬ 
advantage of requiring that * be com¬ 
puted. 


A more promising approach to the above 
eigenvalue problem utilizes the Rayleigh 
quotient (Ref. 13, page 83), i.e., the smallest 
eigenvalue Xj of Eq. 2-25 is given by 


min y^ Ky 
y^O y^My 


(2-27) 


If the vector y is normalized by fixing one of 
its elements, the resulting vector denoted y, 
then Eq. 2-27 reduces to 


min y ^ Ky 

y WMy * 


(2-28) 


The minimization Eq. 2-28 may now be 
solved by any of the methods of the present 
chapter. The method of conjugate directions 
has been recently applied to solve this class of 
problems (Refs. 14, 15). It is interesting to 
note that this exact approach to the eigen¬ 
value problem was proposed by the inventor 
of conjugate direction methods, M. R. 
Hestenes, in 1955 (Ref. 16, page 93). The 
technique was apparently not used in engi¬ 
neering problems, however, until 1966. 


Iterative methods of the kind outlined in 
this paragraph are particularly appropriate for 
iterative optimal design techniques. As dis¬ 
cussed in Chapter 5, the most time consuming 
task in iterative design methods is the re¬ 
analysis of the system during each iteration; 
i.e., after the design variable is changed 
slightly, analysis for stresses, displacements, 
and eigenvalues must be done even though it 
is expected that these quantities will be very 
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close to their values before the change in 
design variables. By using an iterative tech¬ 
nique such as conjugate directions, the pre¬ 
vious state may be used as an estimate to start 


the minimization algorithm. In this way, rapid 
convergence to the new state of the system is 
attained. This approach has been applied with 
good success (Ref. 15). 
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CHAPTER 3 

LINEAR PROGRAMMING 


3-1 INTRODUCTION 

In the preceding chapter a function f{x), x 
in was minimized with no restrictions 
placed on the location of the design variable 
X. Problems in the real world seldom reduce 
to this form. In virtually all engineering design 
problems, requirements are placed on the 
object being designed, and these requirements 
are stated in terms of equations involving the 
design variable. More often, these require¬ 
ments may be stated in terms of inequalities 
involving the design variable. 

Examples of inequality constraints are 
abundant in all areas of engineering design. 
The following are examples: 

1. Optimal structural design 

a. Stress must be less than or equal to the 
yield strength of the material. 

b. Buckling load must be greater than 
or equal to applied loads. 

c. Deflection of the structure must nqt 
exceed specified limits. 

d. Natural frequency must lie within an 
allowable range. 

2. Optimal circuit design: 

a. Voltage must remain within linear 
range of components. 


b. Power consumption must be below a 
specified level. 

c. Capacitance of a proposed capacitor 
must be within attainable limits. 

3. Aerospace vehicle guidance: 

a. Controller thrust must be within the 
capability of the thruster. 

b. Total fuel consumption for a mission 
must be less than or equal to the vehicle’s 
storage capacity. 

c. Altitude must be greater than or 
equal to zero. 

This list of typical inequality constraints 
could be expanded many-fold. It is clear then 
that the inequality constraint must play a 
central role in any unified theory of design. 

The class of problem considered in this 
chapter is very restricted. Only linear func¬ 
tions are to be minimized subject to con¬ 
straints which are linear in the design vari¬ 
ables. In matrix notation this is, minimize 

nx) = C^x (3-1) 

where C is an n x 1 matrix of constants. The 
design variable x is required to satisfy 

Ax < B \ 

> (3-2) 

jc> 0 ' 

3-1 
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where A is an m x n matrix and ^ is an m x 1 
matrix. The inequality, Eq. 3-2, is taken as 

2 flyy. . < h / ” 1 ,, n . 

i.e., when one vector is less than or equal to 
another vector, each of the components of 
the vectors must satisfy this relation. 

Example 3-1: Consider the problem of 
minimizing 

f(x) = X, + (3-3) 

subject to the constraints 

2Xj +X 2 <4 \ 

x,>0 > (3-4) 

Xj > 0. / 

The constraints, Eq. 3-4, are satisfied at all 
points in the triangular region of Fig. 3-1. The 
lines passing through this region are lines of 
constant value of /(x). It is clear that as the 
line is translated downward, the value of/{x) 
decreases and that the lowest line that still 
contains points in the admissible region oc¬ 
curs for Xj + 2x2 ” Since this line 
intersects the admissible region only at (0,0), 
/(x) takes on an absolute minimum at (0,0). 



Figure 3-1, Graphical Solution of 
Example 3-1 


As will be seen in the following paragraph, 
this is typical of linear programming prob¬ 
lems. 

Before proceeding to the next paragraph, it 
is worthwhile to discuss the applicability of 
linear programming. The theory of linear 
programming arose out of studies of econom¬ 
ic activities. In economics it is often the case 
that behavior of an economic system is 
predictable only in a rather crude way, so 
frequently a linear relation among variables is 
as good a representation as can be expected. 

In engineering design, however, it is very 
seldom that the behavior of an object or 
process can be described by linear expres¬ 
sions. One might be tempted, then, to com¬ 
pletely ignore linear programming. Even 
though it is not directly applicable to most 
engineering design problems, however, linear 
programming is still a very powerful tool. 
First, even though the computational pro¬ 
cedures of linear programming do not carry 
over to the real nonlinear world, many facets 
of the behavior of solutions are very similar in 
more general programming problems. The 
engineer who has mastered linear pro¬ 
gramming will go into the study of the much 
more complex nonlinear programming armed 
with a powerful tool - intuition. Further, the 
solution of many nonlinear problems can be 
reduced to the solution of a sequence of 
linear programming problems. For a review of 
some of these applications of linear pro¬ 
gramming methods see Ref. 1. 

a-2 PROPERTIES OF LINEAR PRO¬ 
GRAMS 

To formalize the discussion of the previous 
paragraph, the following definition is made. 

Definition 3-1: The linear programming 


3-2 
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problem is the problem of determining that x 
in which minimizes 


B'^x 

and which satisfies 
Ax> C 

X > 0 i 


LP 


(3-5) 


(3-6) 

(3-7) 


where C ^ 0 is an m x 1 matrix, A is an m x n 
matrix, S is an n x 1 matrix and the 
symbolism < (>) as applied to matrices means 
that the relation less than or equal to (greater 
than or equal to) holds for corresponding 
components of the matrices. 


describe an admissible object or process, i.e., 
one which performs the required service but is 
not necessarily optimal. In LP the constraint 
set is a polyhedron and, according to Def. 2-4, 
this constraint set is convex. Further, accord¬ 
ing to the same definition, the cost function 
f{x) for LP is convex. If the constraint set is 
bounded and nonempty, it is necessarily also 
closed and all the hypotheses of Theorems 2-2 
and 2-3 are satisfied. One then concludes that 
fix) has a strict absolute minimum in the 
constraint and that is has no other relative 
minima. 

Further, if fix) had a minimum in the 
interior of the constraint set, the necessary 
condition of Theorem 2-1 implies 


It should be pointed out that Eqs. 3-5 
through 3-7 do not explicitly cover all linear 
optimization problems. For example, it may 
be required to maximize a linear objective 
function. Further, equality constraints may 
be imposed and negative values of the x^ may 
be allowed. However, all these variations on 
the linear programming problem may be put 
into the form of the problem previously 
considered. An objective function may be 
maximized by minimizing its negative, equal¬ 
ity constraints are nothing more than a pair of 
inequality constraints (i.e., y = 0 if and only if 
y < 0 and - y < 0), and a negative x^ may 
always be written as the difference between 
two new non-negative variables. There is 
therefore, no loss of generality in considering 
only the problem expressed by Eqs. 3-5 
through 3-7. 


— =c^ = 0,/= l,...,n 

dx, 

which contradicts Def. 3-1 of LP. Therefore, 
fix) cannot have a minimum point in the 
interior of the constraint set but must take on 
its minimum at the boundary. Weyl has 
shown, in fact, that the solution must lie on 
one of the vertices of the polyhedral con¬ 
straint set (Ref. 2). 

In spite of this elementary theory, it is 
possible that a linear programming problem 
may not have a solution. This may happen for 
two reasons. First, the constraint set may be 
empty; and second, the constraint set may be 
unbounded and the cost function may be 
decreased without restriction. In order to 
facilitate discussion of these difficulties. 
Definition 3-3 is made. 


Definition 3-2: The constraint set for the 
linear programming problem of Def. 3-1 is the 
set of points in R” which satisfy Eqs. 3-6 and 
3-7. 

The constraint set associated with a prob¬ 
lem is just the set of design variables which 


Definition 3-3: If the constraint set of LP 
is nonempty (empty), the problem is called 
feasible (infeasible). If the constraint set is 
unbounded and the cost function is not 
bounded below, then the problem is called 
unbounded. 


3-3 
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The concept of the dual problem that will 
be used in constructing solutions of LP’s will 
now be discussed. The dual problem will also 
play a major role in obtaining results for more 
general optimization problems. 

Definition 3-4: The linear programming 
problem of maximizing 

C^y (3-8) 

for y in satisfying 

( LPD 

(3-9) 

where the matrices A, B, and C are the same 
as in LP, and are called the dual of LP. 

The results of Theorem 3-1 relating LP and 
LPD are proved in Ref. 3, page 41, and Ref. 
4, page 118. 

Theorem 3-1: Let jc and y be in the 
constraint sets of LP and LPD, respectively. 
Then 

C'^'y (3-10) 

2. If C^y = B^x thenar andy (3-11) 
are the solutions of LP and LPD, respectively. 

3. If LP (LPD) is unbounded, then LPD 
(LP) is infeasible. 

4. If LP (LPD) is feasible and LPD (LP) is 
infeasible, then LP (LPD) in unbounded. 

These results are useful in constructing 
solutions of linear programming problems. 
They are also used in providing Theorem 3-2 
that is central to linear programming theory. 

3-4 


Theorem 3-2: Let LP and LPD both be 
feasible. Then both have solutions 3c and y, 
respectively, and B^x = C^y. 

The proof of Theorem 3-2 is involved and 
does not yield a method of constructing 
solutions. It may be found in Ref. 3, page 44, 
or Ref. 4, page 118. 

Since the solution of LP must lie on a 
vertex of the polyhedral constraint set, it 
suffices to check at most a finite number of 
points for the minimum. This procedure is 
followed in an organized way by beginning at 
any vertex of the constraint set. If the cost 
function cannot be decreased by moving 
along an edge of the polyhedron that inter¬ 
sects this vertex, then this vertex is the 
solution. If, however, the cost function de¬ 
creases by moving along some edge, this 
policy is followed until a second vertex is 
reached and the cost function has been 
reduced. Since there are only a finite number 
of vertices and it is impossible to return to a 
previously occupied vertex, the process must 
terminate at the minimum over the constraint 
set. 

In order to illustrate the argument pre¬ 
sented in the preceding paragraph, consider 
Example 3-2. 

Example 3-2: By moving along edges of 
the constraint set, solve the LP 

minimize/(jc,, JC 2 ) = - 2 jci - JC 2 
subject to 

- Xi > — 1 

- X 2 > - 1 

- 2xi - 2^2 > - 3 
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JC1.X2 > 0 . 

Solution: The polyhedral constraint set is 
shown in Fig. 3-2. 



The vector 


The only move admissible is toward (1/2, 
1). A unit move in this direction is obtained 
from 


r 



2 


which causes a change in/, 

df= Vni,H2)dx = + y/7- ■^= 0. 

Therefore, / may not be decreased in moving 
from the vertex (1, 1/2) so this point is the 
solution of the problem. 


-aVfT(xt ,x,) = « 

whose direction as shown in Fig. 3-2 is the 
direction of steepest descent of f{x). Starting 
at (0,0) a unit movement along the Xi -axis 
yields a change 

d/= V/(0,0)dx = - 2 

and a unit movement along the x 2 -axis yields 
a change 

d/= V/(0,0)dJc = - 1 

so both moves yield a decrease in f{x). Choose 
the JCj-axis and move to the first vertex (1,0). 
The only movement possible is in the 
+ X 2 -direction from (1,0). A unit move in this 
direction yields 



The idea of moving from vertex to vertex is 
good for visualization but is poor for higher 
dimensional problems. The same idea, how¬ 
ever, can be implemented algebraically. In 
order to obtain relations which will be re¬ 
quired for solution of LP, define slack vari¬ 
ables Ui,..., so that 

Ax - C-u > 0. (3-12) 

The cost function of Eq. 3-5 will be denoted 
by the variable 


w = B^x. 


(3-13) 


The problem LP now takes the form 
Ax -C-u = 0 \ 


X > 0 

u> 0 


> IP'. 


d/= V/(l,0)dx = - 1 


w = B^x = minimum / 


which decreases /. Move in this direction to The solution of LP' is the same as the solution 

the first vertex (1,1 12), of LP. 
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The information contained in Eqs. 3-12 
and 3-13 is contained in the following matrix 
equation (called the simplex tableau): 



Eq. 3-14 may be viewed as m + 1 equations 
involving the variables JCi,Ui,..., u^, 
w. At present Eq. 3-14 may be interpreted as 
determining m,, and w explicitly in 

terms of Xi,.... x^. It might be desirable to 
determine some other combination of m + 1 
of the variables in terms of the remaining n. 
Except in singular cases, this is possible. 


Assume now that m + 1 of the variables 5i. 
..., , and w have been determined explicitly 

in terms of the remaining n variables Tj ,.... . 

Eq. 3-14 will then take the form 



where primes denote coefficients obtained 
when the original set of equations is solved 
for5i,.... 5^,and w. 

The solution of LP will be constructed 
3-6 


using a method which is based largely on 
Theorem 3-3. 

Theorem 3-3: If in Eq. 3-15 b' > 0, i = 1, 
.... n, and —> 0, / = 1, .... m, then the 
solution of LP is 

r^ = 0,/= 1 ,..., n 

1 ,.... m 

w = 6. 

It is clear from this theorem that any 
method of choosing the variables s. and 
which will terminate with non-negative entries 
in the last row and column, except perhaps 
for 6, will serve as a method of solving LP. 
Before developing such a method, several 
definitions will be helpful. 

Definition 3-5: In Eq. 3-15, the variables 
Sy. / = 1,.... m, are called basic variables, while 
^e variables i = 1,.... n are called nonbasic 
variables. 

Definition 3-6: The set of variables 5 1 ,.... 
•5;,,. ^ 1 » will be called a basic point. If Cy 
< 0, / = 1,..., m, in Eq. 3-15, then the basic 
point will be called a basic feasible point. 

A certain geometric interpretation may 
now be given for the nonbasic variables. In 
LP' it is clear that the boundary of the 
constraint set of LP is obtained by setting 
various combinations of the variables i = 1, 
.... n and Uj, / = 1,.... m, equal to zero. In the 
space /?" of the design variable x, a vertex of 
the polyhedral constraint set is obtained by 
having n equality constraints among the x^. i = 
1, .... n, enforced. By the discussion, this 
occurs when = 0, / =1,.... n. An edge of this 
polyhedron is a line in obtained by setting 

= 0 for n - 1 indices /. From Def. 3-6 and 


( 
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Eq. 3-15, it is clear that a basic feasible point 
corresponds to a vertex of the polyhedral set. 
This is true since setting the nonbasic vari¬ 
ables of the basic feasible point equal to zero 
yields admissible basic variables. Further, two 
vertices lie on the same edge of the constraint 
set if they have n - 1 of their nonbasic 
variables in common. 

The process for interchanging the roles of a 
basic and a nonbasic variable thus becomes 
the central tool for methods based on Theo¬ 
rem 3-3. Suppose it is desired to make a 
nonbasic variable and a basic variable. If 
^ 0 then the /th equation from Eq. 3-15, 

may be solved for to obtain 



Using this expression for may be 

eliminated from the left sides of the remain¬ 
ing equations in Eq. 3-15. For k ^ / this yields 




It is thus clear how the coefficients in Eq. 
3-15 change as the roles of a pair of variables 
are interchanged. This process may be de¬ 
scribed concisely in the language of Definition 
3-7. 

Definition 3-7: The entry fljy # 0, preced¬ 
ing Eq. 3-16, is called the pivot of the 
transformation. The transformation itself is 
called a pivot step. 

The effect of the pivot step on the coef¬ 
ficient matrix of Eq. 3-15 may be illustrated 
easily by the diagram 


ft /V 


’ 1 

Oi “ 

p “ 


P 

P 

B 'v 


P 


7 


7 

LP 

P _ 


The diagram shown by Eq. 3-18 simply relates 
that in the coefficient matrix of Eq. 3-15 the 
following changes occur. The pivot is replaced 
by its inverse. All other elements in the same 
row as the pivot are multiplied by the 
negative inverse of the pivot. All other ele¬ 
ments in the same column as the pivot are 
multiplied by the inverse of the pivot. All 
other elements in the matrix are decreased by 
the product of the element in their column 
and the row of the pivot, the element in their 
row and the column of the pivot and the 
inverse of the pivot. 


3-7 
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Example 3-3: 


Given 

-1 

1 

’"'■r 




3 6 1 

ri 

= 

^2 


^ 5 3 2_ 

.1- 




interchange the role of and ^ 2 . 


Solution: The new matrix relation is 


2/3 - 3 - 14/3“ 


$2 



1/3 -2 - 1/3 


'-2 

= 

ri 

5/3 13 ll/3_ 


_ 1 _ 




It is shown in Ref. 3, page 53, that this 
pivoting transformation preserves the dual 
linear programming problem. 

The pivoting transformation is an organized 
tool which allows one to interchange basic 
and nonbasic variables. It remains only to 
obtain an algorithm which uses this tool and 
Theorem 3-3 to construct the solution of LP. 

a-3 THE SIMPLEX ALGORITHM 

As was shown in par. 3-2, the solution of 
the linear programming problem may be 
reduced to the choice of pivot points. The 
algorithm presented here will have two 
phases. The first phase will consist of an 
algorithm for obtaining a basic feasible point. 
The second phase will operate only with basic 
feasible points and will successively reduce 
the cost function until the hypotheses of 
Theorem 3-3 are satisfied. 

For convenience in the discussion which 
follows, it is assumed that the choice of basic 
and nonbasic variables has been made at a 

3-8 


given stage of the solution process and the 
primes of Eq. 3-15 are dropped, i.e.. 


^11 — ^in 


■'•r 


'Si' 

Om 1 ... Ofn n —Cm 



= 

Sm 

b^ ... b„ 6_ 


1 _ 


W 


(3-19) 


Primes will now be used to denote the 
coefficients that result from a pivot step 
applied to Eq. 3-19. These new coefficients 
are determined by applying Eq. 3-18. 


3-3.1 DETERMINATION OF A BASIC FEA¬ 
SIBLE POINT 


If some elements in the right-hand column 
of the matrix of Eq. 3-19 (other than 6) are 
negative, then the present choice of variables 
is not a basic feasible point. Let - be the 
negative entry nearest the bottom of the 
column (again excluding 6). Since when = 
0, / = 1,..., n, - - Cj^ < 0, if there are 
admissible points in the constraint set of LP, 
then it must be possible to increase 5^ by 
increasing some from zero; i.e., there must 
be some positive Choose /o so that > 
0. This fixes the column index of the pivot. 

To find an admissible row index /q, con¬ 
sider first that after the pivot step 



It is clear then that candidates for the pivot 
must be limited to indices / for which 


Ci 

— > 0. (3-20) 
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With this restriction in mind, consider the 
values of cj after the pivot step with / ^ /q. 


The process described may be given quite 
simply as the iterative i4/gor///im LP-A : 


These are 


-c\ = -Ct* 


/o 


Step 1. Choose — as the lowest negative 
entry (with the exception of 6) in 
the right-hand column of the co¬ 
efficient matrix of Eq. 3-19. 


In order to insure - c\ > 0, / > k, it is 
required that 

_ c. > 0, / > k.i^U . (3-22) 

If > 0 this clearly holds. If < 0, 
however, the requirement, Eq. 3-22, may be 
rewritten as 

(3-23) 

Further, for / = k, 

-c^ ~ * ~^k (3-24) 


Step 2. Choose any positive element 
in the A:th row of the matrix of 
3-19. 

Step 3. Choose as in Eq. 3-25. 

Step 4. Perform the pivot step with pivot 
^*o/o‘ 

Step 5. If any — < 0, / = 1,.... k, choose 

that one with largest index / and 
return to Step 1. If - > 0, / = 1, 

..., m. then a basic feasible solution 
has been found and the process 
may be terminated. 

a-3.2 SOLUTION OF LP 


since fl^y^ 


> 0. 


Inequalities, Eqs. 3-23 and 3-24, show that 
if /q is chosen so that 


^1*0 __ min 

(A 


Vtio 



(3-25) 


then — c*. > 0,i> k and — . If — 

is still negative, the process may be repeated. 
Otherwise choose the next entry above - 
which is negative and repeat the process. 

If all the Cf, i> kzxt nonzero, only a finite 
number of basic points are possible since the 
process is monotone (nonrepeating). If there 
exists a point with — c. > 0, / = 1,..., m, this 
process must find it. The degenerate case in 
which some k is discussed later. 


In par. 3-3.1 an algorithm is given for 
finding a basic feasible point. Once this has 
been accomplished, the object is to find a 
second algorithm which successively reduces 
w. 

Since by Eq. 3-19, w = biTi +... + + 

6, it is clear that if by^ < 0 for some / = 
then w may be reduced by increasing r.^ from 
zero. If a pivot step is performed which makes 
Ty^ a basic variable then w will be decreased. 
The choice of the basic variable s, which is 
to be made nonbasic must be made in such a 
way that the point obtained after the pivot 
step is still a basic feasible point, i.e., so that 
— c. > 0, I = 1, ..., m. However, this is 
precisely the restriction which led to the 
choice of in par. 3-3.1. Therefore, the same 
procedure for choosing may be employed 
here. 
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Since w' = w - the pivot step 

determined here guarantees w' < w provided 
all - > 0, / = 1, .... m. In this case, 

therefore, only a finite number of pivot steps 
may be made, and the process must terminate 
at the solution of the linear programming 
problem. Termination occurs when > 0, / = 
1, n. Theorem 3-3 shows that this is the 
solution of the linear programming problem. 
The degenerate case where some = 0 will be 
discussed par. 3-3.3. 

This process is given explicitly in Algorithm 
LP-B: 

Step 1. Choose any negative entry (except 
6 ) in the bottom row of the 
coefficient matrix of Eq. 3-19. 

Step 2. Choose according to Eq. 3-25 
with k-\. 

Step 3. Perform the pivot step with pivot 

Step 4. If any by < 0, / = 1,.... n, choose 
one by^ < 0 and return to Step 1. 
If by > 0 , / = 1 , .... n, then the 
solution of LP has been found. 

a.3.3 THE DEGENERATE CASE 

In both pars. 3-3.1 and 3-3.2 the computa¬ 
tional algorithms could have problems if some 
c^ = 0. This situation is called degenerate since 
when n constraints are made equalities by 
putting = 0, j - 1 ,.... n, one has 5 ^ = = 0 

which means that still another constraint is an 
equality. The degeneracy arises from the fact 
that in LP the n dimensional design variable x 
= (Xj, ...,x^) satisfies n +1 linear equalities. 

Therefore, the n + 1 equations are not linearly 
independent. 

3-10 


Viewed geometrically, the difficulty occurs 
because the path which successive basic points 
follow on the polygonal constraint boundary 
may form a closed loop. To prevent this 
behavior with only a small error in the final 
solution an entry, - which is zero, is 
replaced by an arbitrarily small parameter e > 

0. The problem is not degenerate any longer 
and cycling cannot occur. Therefore, the 
altered problem will proceed toward the 
solution. 

Example 3-4: Use the simplex algorithm to 
solve the LP 

minimize 2 xi + 9 x 2 + ^3 
subject to 

Xi +4x2+ 2 x 3 ^ ^ 

3x, +X 2 + 2 x 3 >4 

xi > 0 " 

X 2 > 0 

X3 > 0 . 

First, LP' is: 


minimize w where 



subject to 

x^> 0, /•= 1.2,3, 0, /=1,2 

( 
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For the first pivot step in algorithm LP-A, 
A: = 2. Choose = 1 since j = 3 is the 
largest element in the second row. /^ = 2 is 
the only choice available in Eq. 3-25 and 


^2 2 


= j = 4> 0. 


The pivot is ^22 = 1- This pivot step inter¬ 
changes ii 2 and jcj • The result is 


-11 4 -6 ir 




-31-2 4 

li2 

= 


- 25 9 - 17 36_ 

^3 


JV_ 


j. 




Note that this basic point is already a basic 
feasible point so that the process now trans¬ 
fers to algorithm LP-B. Since is most 
negative, choose /q = 1 • Now, 



so /^ = 1 . The pivot is then an = - 11 . The 
result of a pivot step is to interchange x, and 
U|. This results in the basic feasible point 



Choose /q = 3. 


C 2 

- = 11/6,- = 11/4 

"13 fl23 


so lo = 1. The pivot is = -6/11 and a 


pivot step leads to 



- = -11/4,- = 1 , 

ai2 fl22 

so /q = 2 and 022 “- 1/3 is the pivot. A pivot 
step yields 



Since this is a basic feasible point and the 
first three elements in the third row are 
positive, then the solution is immediate. The 
nonbasic variables are zero, 

= JCl = ^2 =0 

and the basic variables take on the value 
JC 3 = 5/2, U 2 = 1, and w = 5/2. 

Therefore, the solution to the original LP is 

jci=0 
X2 = 0 

X 3 = 5/2. 

The minimum value of f(x) attained is 5/2. 

3-11 
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34 MINIMUM WEIGHT TRUSS DESIGN 

As will become apparent in subsequent 
chapters, most optimal design problems are 
nonlinear. Even the problems considered in 
this paragraph appear at first glance to be 
nonlinear. However, it is shown that the 
problem can actually be solved as a linear 
program. This will not be the case in general. 
The class of problems and their solutions that 
are discussed in this paragraph are taken from 
an outstanding paper by Dorn, Gomory, and 
Greenberg (Ref. 5). Similar results have been 
reported more recently (Ref. 6). 

The problem treated here is minimum 
weight design of plane trusses with constraints 
on stress. The initial restrictions on the truss 
include only the location of joints in the 
truss. The loads to be supported by the truss 
are applied at joints. A member with non¬ 
negative cross-sectional area is allowed to 
connect each pair of joints. If there are 
joints, there may be //(/x - l)/2 members in 
the truss. In general, then, statically indeter¬ 
minate trusses are allowed. 

Let / = 1, ..., n, denote the cross- 
sectional area of /th member and the load 
in that member due to the external loads 
applied to the truss; iS^ > 0 denotes tension. If 
m = 2/i, then equilibrium of the joints of the 
truss is specified by the equations 

n 

2 (3-26) 

/ « 1 ' ' 

where are components of applied forces at 
the joints, and are direction cosines of the 
elements of the structure intersecting the /th 
joint. All a.^ are zero if the /th element does 
not intersect the point of application of F^. In 
order to satisfy three equilibrium equations 
for the applied loads (including reactions at 
supports), it is assumed there are m* = m - 3 

3-12 


linearly independent equations in Eq. 3-26. 

If a is the maximum allowable stress (both 
tensile and compressive) for the material from 
which the truss is constructed, then stress 
constraints are 

\S^\< oA^, (3-27) 

Further, if p is the weight density of the 
structural material, the total weight W of the 
truss which is to be minimized is 

n 

(3-28) 

where is the length of the /th member. 

The problem of minimizing W of Eq. 3-28 
subject to the constraints of Eqs. 3-26 and 
3-27 is not the complete truss design problem. 
In addition to the equilibrium conditions of 
Eq. 3-26, a set of compatibility conditions 
between displacements of the joints must be 
satisfied. These compatibility conditions will 
be nonlinear in the variables and A^. In its 
complete formulation, then, the truss design 
problem is not a linear programming problem. 
It will be shown, however, that if the com¬ 
patibility conditions are ignored and the 
problem described by Eqs. 3-26, 3-27, and 
3-28 is solved, its solution satisfies the com¬ 
patibility conditions and is, therefore, the 
solution of the truss design problem. 

Recalling that compatibility relations are 
being ignored, it is required that 

IS) I =aA^, /= l,...,n. (3-29) 

This is true since if IS) | < oA^ for some /, then 
A^ could be reduced with an accompanying 
r^uction in H'. The constraint, Eq. 3-27, is 
therefore replaced by Eq. 3-29. The reader 
should note that this argument would not be 
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valid if compatibility conditions were being 
enforced, since a reduction in some may 
result in a violation of a constraint not 
involving explicitly. 

Since by Eq. 3-29, A^ ” ^1*^/1» 1^® optimiza¬ 
tion problem is now to minimize 


IP', namely, find jc to minimize 
B’^x (3-30) 

subject to 

Ax-C^Q (3-31) 




subject to Eq. 3-26. In order to treat this 
problem as a linear programming problem, 

5^, if 5^ > 0 
0, if5^<0 
0, if5^. > 0 
5^. if5y < 0 

Now, 

s,-s;-sf 

and 

I 5^ I =5/+57 

Denote 


define 






=(S* .57 .....5- ) 

^ I ■” ^ii^m X 2n 


X>0. (3-32) 

This linear programming problem may now 
be solved by the simplex method. Before the 
solution of the linear programming problem 
can be taken as the solution of the truss 
design problem, however, it must be shown 
that it satisfies the compatibility conditions. 
It is clear that if the truss specified by the 
linear programming problem is statically 
determinate, it satisfies the compatibility con¬ 
ditions trivially (i.e., there are no compatibil¬ 
ity conditions). For the analysis here, stat¬ 
ically determinate is taken to mean that the 
member forces Sj are uniquely determined by 
the given loads and the equilibrium conditions 
ofEq. 3-26. 


As pointed out in Ref. 5, page 32, there 
will be m* possibility nonzero components of 
X (basic variables) in the solution, correspond¬ 
ing to linearly independent columns of the 
matrix i4; i.e., only m* of the will possibly 
be nonzero. According to Eq. 3-27, then, only 
m* of the areas may be nonzero. Further, 
since the rank of A is m*, the member forces 
are uniquely determined. The resulting truss 
is, therefore, statically determinate and hence 
is the solution of the original truss design 
problem. 


and 







In this notation, the problem is of the form 


It is pointed out (Ref. 5) that the simplex 
method for solving many member truss design 
problems is relatively time-consuming. It is 
proposed that the method be refined for this 
class of problems to obtain a practical method 
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of solving engineering design problems. Sever¬ 
al examples are solved in considerable detail 
in Ref. 5; the results of one of these problems 
will be discussed here. 


subinterval, the member sizes are different. A 
plot of H' vs a and the forms of optimal 
trusses are shown in Fig. 3-4. 


A bridge truss is to be designed to span two 
points, 1 and 13 of Fig. 3-3. Three vertical 
levels of joints are allowed with five horizon¬ 
tal sets, a total of 15 points, as shown in Fig. 
3-3. In the general case there could be 
15(14)/2 = 105 members in the truss. Loads 
on the floor of the truss are shown in Fig. 3-3. 


ri 



8- 11- 14* 



Figure 3-3, Admissible Joints for Bridge Truss 


In the solution presented in Ref. 5, it is 
assumed that the truss is symmetric about the 
line of joints 7-8-9. This assumption reduces 
the number of variables to 57. Further, due to 
the assumed symmetry, there are only 14 
independent equilibrium conditions. There¬ 
fore, there will be only 14 members which 
can be nonzero in the optimum truss. In the 
solution presented in Ref. 5 the problem is 
made nondimensional by defining a = h/9. and 
where h and fi are the vertical and 
horizontal spacing, respectively, and H and V 
are applied loads shown in Fig. 3-3. 


w 



Figure 3-4, Optimum Bridge Trusses 


The discussion here only touches on the 
highlights of the very complete treatment of 
the truss design problem in Ref. 5. The 
interested reader is encouraged to study this 
outstanding article in detail. 

Before leaving the truss design problem, a 
point of interest in the present results and in 
the results obtained in future chapters may be 
noted. In Fig. 3-4 it is clear that at two values 
of a the form of the optimal truss changes 
form drastically. Still, even though the 
topology of the structure is not continuous in 
a, the weight apparently is a continuous 
function of a. The same sort of behavior 
occurs in a beam design problem with con¬ 
straints on deflection which is discussed in 
par. 7-4. These problems might lead one to 
suspect that there is some basic mathematical 
structure of the optimal structural design 
problem that has not been uncovered. 


The solution presented in Ref. 5, page 45, 
for a fixed value of = 1) shows that there 
are three subintervals of values of a on eadi 
of which the truss has a constant geometrical 
form. For different values of cc within a given 
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3-5 AN APPLICATION OF LINEAR PRO¬ 
GRAMMING TO ANALYSIS 

. A major application of linear programming 
in engineering design is, oddly enough, in 
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nonlinear programming. It is seldom that a 
realistic engineering design problem can be 
formulated as an IP. Realistic problems are 
generally nonlinear when considered as a 
function of both state and design variables. 
Several techniques of solution of nonlinear 
programming problems are based on approxi¬ 
mation of the nonlinear problem by a linear 
one, at least locally. These methods then 
require that the approximating IP be solved. 
This subject will be deferred until a discussion 
of the general theory of nonlinear pro¬ 
gramming has been given. 


A second application of linear pro¬ 
gramming which is of concern to the engineer 
is in the solution of linear boundary-value 
problems that arise in such fields as con¬ 
tinuum mechanics. It should be emphasized 
here that this application is not of an optimal 
design nature, but rather falls in the field of 
engineering analysis. 

One of the important methods of solving 
linear boundary-value problems is to approxi¬ 
mate the solution by a linear combination of 
known functions. The question arises, “How 
should the coefficients be chosen so as to 
obtain the 'best* approximation to the true 
solution?” “Best” may be defined in many 
ways. A relatively new concept of “best” will 
be discussed in this paragraph. 

The general linear boundary-value problem 
may be stated in operator notation as 


tial equations on < jc < 

^ d'z 

L{z] = 2 a^{x) —J (3-35) 

/-o' dx' 

and the boundary operator is 
B[z\ = Az(xi)'¥ Bz{x 2 ). (3-36) 

In the case of partial differential equations. 


I[zJ = 2 a (x) 

la I <m ® 


3 '“ >2 




(3-37) 


and the boundary operator is 

B[zJ = A(x)z (x), X on r. (3-38) 

The method to be discussed treats both the 
partial and ordinary differential equations in 
the same way. Let 0y(x), / = 1,..., k satisfy the 
homogeneous differential equation 

=0,inf2. (3-39) 

Further, let 0q(x) be found such that 

= QM/in^. (3-40) 

Since the operator L is linear, the new 

function 


LU]=G(x), xinD. (3-33) 

B [z] = q(x), X on r (3-34) 

where SI is the domain of the independent 
variable xeR" and F is its boundary. The 
dependent variable is a vector function of x, 
z{x) in R"*. In the case of ordinary differen- 


k 

z=0^+^2^ c^4>f{x) (3-41) 

satisfies the differential Eq. 3-33 regardless of 
the value of the constants c^. The object is 
now to find these constants so that z satisfies 
the boundary conditions of Eq.3-34 as closely 
as possible. 
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Define 

II5 [11 - <7 II = *"*’' 1 1^1 - I • (3-42) 

In this notation, z will be the solution of the 
boundary-value problem if and only if 

WBll] -q(x)\\ = 0 (3-43) 


Example 3~4: Obtain an ai^roximate solu¬ 
tion of 


a^z ^ a^z ^ 1 az 

a^i dxl x^ dx^ 


(3^8) 


inn={(jCj,Xj)| IXj I < 1,0 < < 1 } with 


for all points x on F. 

The method to be treated here attempts to 
minimize the error in Eq. 3-43 at a large 
number of points x®, £ = 1 ,..I, on F. 
Define 


z + —= 0 onF = ((Xj,JC 2 )| = 1 , 

Xj = 0 or Xj = 1 ) (3-49) 

where n is the interior unit normal to F. 


7 = ’"‘"||B[I(;c®)] -q{x^)\\. (3^4) 

The object now is to choose the constants 
so as to minimize y. To see that this is a linear 
programming problem, note that Eq. 3-44 is 
equivalent to 

B,lz(x’‘)] -q,ix‘)< y (3-45) 

and 


Put 



(6 =2 x^ — x^ 

= 8 ArJ -24 jcJ jr* + 3x^. 


V (3-50) 


/ 


-fijz(x*)l +<?,(x')< 7 


(3-46) 


Note that these functions satisfy Eqs. 3-39 
and 3-40. 


for all / and £. 

Note that Eqs. 3-45 and 3-46 are linear in 
the Cy and y. Since the C/ may be either 
positive or negative, it is necessary to define 
new constants > 0 and cr > 0 such that 

= -cj- . (3-47) 

Now, the problem of choosing 7 , cj, cj (all 
non-negative) which satisfy Eqs. 3-45 and 
3-46 and which minimize 7 is clearly a LP. 
Further, it is just a restatement of the best 
approximation criterion of Eq. 3-44. 


The domain n and its boundary F are 
shown in Fig. 3-5. Partial derivatives with 
respect to the interior normal are shown. 



Figure 3-5, Boundary Condition for 
Example 3-4 
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The procedure is now to form 
^ + c, + C 2 ( 2 xJ - x\) 

+ C3 - 24jcJx* + 3jCj) 


Similar inequalities in the c*, and 7 will be 
obtained at all other boundary points chosen. 
Under the requirements c* > 0, cr> 0, and 
7 > 0 , the problem of minimizing 7 is then 
solved. 


and, with the aid of the expressions for bz/bn 
in Fig. 3-5, compute z + bl/bn at L points 
around the boundary F. At a typical point, 
e.g., ( 1 , 1 / 2 ), 


z + 


— 

bn 


16 


+ -(11/4)c3 


Rabinowitz in Ref. 1, page 141, reports 
that an approximate solution obtained by the 
above method is 

Cl =-0.5571, cj = 0.0764, 03 = 0.0024, 
7 = 0.0053. 


-(285/16)c3. 

At this point it is required that 

l/16 + (c| -ci-)-(ll/4)(c;-cp 

- (285/16) (c^3-C3)< 7 


This means that at all the boundary points x®, 
I z + bJ/bn 1 < 0.0053. A result called a 
maximum principle from the theory of 
second-order elliptic partial differential equa¬ 
tions then implies 

I z(x) - z(x) I < 0.0053, X in n 


and 

- 1/16 - ic\ - c;) + (11/4) ic\ - ) 

-(285/16)(c;-C3)<7. 


where z(x) is the true solution of Eqs. 3-48 
and 3-49. This powerful result guarantees that 
the approximate solution z generated by 
linear programming is within 0.0053 of the 
true solution throughout H. 
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CHAPTER 4 

NONLINEAR PROGRAMMING AND FINITE 
DIMENSIONAL OPTIMAL DESIGN 


4-1 INTRODUCTION TO THE THEORY OF 
NONLINEAR PROGRAMMING (NLP) 

As pointed out in the preceding chapter, 
inequality constraints play a central role in 
engineering design problems. The inequalities 
treated in Chapter 3, however, are of a rather 
special form, namely, they involve only linear 
functions of the variables of the problem. It is 
a rare real-world design problem which can be 
put into this form. In general, the inequality 
constraints as well as the cost or return 
function in real-world problems are nonlinear. 
For this reason, a more general theory than 
that presented in Chapter 3 is needed. 

The class of problems considered here is 
called nonlinear programming, or math¬ 
ematical programming. A vast amount of 
literature has been devoted to this class of 
problems in recent years. Several books on 
the subject which contain reviews of this 
literature are Refs. 1, 2, and 3. In view of this 
extensive literature, the purpose of this para¬ 
graph is simply to state the nonlinear pro¬ 
gramming problem and present some key 
results needed in the study of methods of 
optimal design. 

4-1.1 NONLINEAR PROGRAMMING 
PROBLEMS 

For convenience and clarity in the develop¬ 
ment of methods of solution, the nonlinear 
programming problem will be stated in two 
forms. The first form is given by Definition 
4-1. 


Definition 4-1: The first nonlinear pro¬ 
gramming problem NLP, is: find xgR'* to 


minimize fix) 
subject to 
0 


NLP 


where 


gix) = 


gi (^) 

gjx) 


(4-1) 


(4-2) 


Unless otherwise specified, it will be 
assumed that fix) and gix) are continuously 
differentiable. Other than this differen¬ 
tiability requirement, fix) and g(jc) are as 
general as required for a particular problem. 


A second form of nonlinear programming 
problem, which may actually be included in 
NLP, is given by Definition 4-2. 


Definition 4-2: The second nonlinear pro¬ 
gramming problem NLP', is: find xeR" to 


minimize fix) 
subject to 
gix) < 0, 
and 

hix) = 0 


NLP' 


(4-3) 


(4-4) 


(4-5) 
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where 

g(x) = 





and 


hi(x) 


AU) = 



solution with only very mild assumptions. 
This result is a consequence of properties of 
/?'*. In the infinite dimensional optimization 
problems of Chapter 6, the space of variables 
lacks these properties so that no analogous 
result is available. 

Theorem 4-2 provides an easy test for 
closedness of the constraint set. 


Unless otherwise specified, it will be 
assumed that fix), gix), and h{x) are con¬ 
tinuously differentiable. 

Very much as in the linear programming 
problem, the points x which satisfy the 
constraints of NLP and NLP' are charac¬ 
terized by Definition 4-3. 

Definition 4-3: The sets of points xe/?” 
that satisfy the constraints NLP and NLP' are 
called constraint sets. They are denoted 


Theorem 4-2: If the functions g(x) and 
h(x) are continuous, then the sets D and D' 
are closed in . 

The boundedness hypothesis of Theorem 
4-1 may be more difficult to check, par¬ 
ticularly in complex problems. One must 
show that there exists a number a such that if 
xgD or £)', then x^x < a. 

To see that NLP' can actually be included 
in NLP, define 


D = {xe/?"1 g{x)< 0} 


+ „ (.x) = h, (x),/= 1, ...,P 


for NLP, and 


and 


D'= {jce/?" I g(x) < 0andh(x) = 0} 
for NLP'. 


'■=!. P- 

Now, NLP' is equivalent to the NLP: 


For convenience. Theorem 2-2, which was 
stated previously in Chapter 2, is given here 
(Theorem 4-1) as it applies to nonlinear 
programming problems. 


minimize fix) 
subject to 
iix) < 0, 


Theorem 4-1: If fix) is continuous on D 
(Z)') and this set is closed and bounded in , 
then NLP (NLP') has a solution which is an 
absolute minimum of fix) in D (Z)'). 


where g(x) = 




This theorem is one of the most easily This is true since 

obtained yet most powerful results in opti¬ 
mization theory. It guarantees existence of a ^/(•^) < 0, / = m + 1,..., m + 2p 
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is just 

hf(x)< 0, /= 1. p 

and 

0 , /= 

which is equivalent to 

h(x) = 0. 

It should be clear that problems of maxi- 

A 

mizing f{x) are put into the form NLP or 
NLP' simply by defining f{x) = - fix). 
Further, constraints of the form g(x) > 0 are 
transformed to the proper form simply by 
defining g(x) = These transforma¬ 

tions involve no theoretical or practical diffi¬ 
culty. As will be seen in par. 4-2, even though 
the transformation of NLP' into NLP involves 
no theoretical difficulty, severe practical diffi¬ 
culties occur. The explicit characterization of 
equality constraints in NLP' will be useful 
later, when methods of constructing solutions 
are discussed. 

Comparing nonlinear programming prob¬ 
lems with the unconstrained problems of 
Chapter 2, one might conclude that the 
nature of the cost function fix) will deter¬ 
mine the location of the minimum point, with 
only a check required to verify that con¬ 
straints are satisfied. Since the linear pro¬ 
gramming problem is a special case of the 
nonlinear programming problem, the results 
of Chapter 3 show vividly that this conclusion 
is false. In the linear problem, the cost 
function plays only a minor role in the 
simplex algorithm and most of the computa¬ 
tional effort is expended operating on the 
constraint functions. 

While results from the linear programming 
problem yield valuable insight into the non¬ 


linear programming problem, one must be 
careful not to generalize too much. To 
illustrate some differences between linear and 
nonlinear programming, two examples will 
now be treated. 

Example 4-1: 

Minimize 

f(x) = (X, - 3)* + (X 2 - 3)* 
subject to constraints 

- xi<0 

-X2 < 0 

xi +X 2 -4< a 

The constraint set is the shaded triangular 
region in Fig. 4-1. 



If the constraints are ignored, fix) takes on 
its minimum at the point (3,3). Observing the 
circles, which are plots of constant value 
curves of fix), it is clear that the smallest 
value fix) takes on in the shaded triangle is 
fi2,2) = 2. This is, therefore, the solution of 
the problem. 

It should be noted that even though the 
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solution occurred on the boundary of the 
constraint set, it did not occur at a comer as 
it would have if the problem had been linear. 

Example 4-2: 

Minimize 

/(ac) = (x, - 1)* +(x, - D* 
subject to the constraints 

-Xi < 0 

-^2 < 0 
JCi +^2 - 4 < 0. 

The constraint set is just the same as in the 
previous problem. The cost function, how¬ 
ever, has been modified. 

If the constraints are ignored,/(x) takes on 
its minimum at (1,1). Since this combination 
of design variables satisfies the constraints, it 
is the solution of Example 4-2. The solution 
of this nonlinear programming problem, 
therefore, occurs in the interior of the con¬ 
straint set. This behavior contrasts sharply 
with that of linear programming problems 
where the solution must occur on the bound¬ 
ary of the constraint set. 

These examples show conclusively that the 
properties of NLP, and hence, also NLP', 
differ considerably from those of LP. 

Theoretical results and computational 
methods for NLP and NLP' will also be more 
complex than those for the linear pro¬ 
gramming problem. The reason for this is 
clear. Strong use was made of linearity of the 
functions involved in the linear programming 
problem, and this linearity is not present in 
the nonlinear programming problem. The 
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increased complexity of nonlinear as opposed 
to the linear problems is not surprising since 
increased complexity generally accompanies 
this transition in all mathematical disciplines. 

Due to the complexity of NLP and NLP', 
methods of obtaining their solutions are 
generally computational in nature. Moreover, 
in many meaningful engineering problems, 
convergence proofs are not available so the 
designer must depend heavily on his engineer¬ 
ing intuition. One must be extremely careful 
in applying engineering intuition to certain 
aspects of optimization problems, however. In 
most problems of engineering analysis, exis¬ 
tence and uniqueness of solutions are taken 
for granted since these properties hold for 
very general classes of problems such as linear 
elasticity, dynamics, circuit theory, and struc¬ 
tural analysis. Existence and uniqueness ques> 
tions in optimization problems are, however, 
by no means trivial. For instance, before the 
designer commits himself to a design based on 
an optimum obtained by a computational 
algorithm, he should seriously consider the 
possibility that this optimum is only relative 
and an absolute optimum exists that will give 
much better results. 

Due to the weakness of intuition in dealing 
with optimization problems and the inherent 
complexity of these problems, the importance 
of theoretical results concerning existence, 
uniqueness, and necessary and sufficient con¬ 
ditions cannot be overemphasized. The re¬ 
mainder of this paragraph and par. 4-2 are 
devoted to these questions, while pars. 4-3 
through 4-5 contain methods for obtaining 
solution of NLP and NLP'. 

4-1.2 GLOBAL THEORY 

In nonlinear programming problems one 
often obtains a relative minimum of f{x) in 
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the constraint set. The question arises, “Is this 
relative minimum an absolute minimum?” In 
general problems it is difficult to answer this 
question. There is a class of problems, how¬ 
ever, in which this question is easily answered. 
This class is described by Definition 4-4. 

Definition 4-4: If D {D') is a convex set 
and /(jc) is convex on D (/>') then NLP (NLP') 
is called a convex programming problem. 

Theorem A-1, Appendix A, guarantees that 
if gfix), / = 1, m, are convex functions, 
then the set D is convex. Since the equalities 
(Eq. 4-5) in NLP' define a surface in/?'*, it is 
clear that D' is the intersection of that 
surface with the set { xe/?'*lg^(x) < 0, / = 1, 
...» m|. The surface is convex if and only if it 
is a plane, or equivalently, if and only if each 
hjix) is linear in x. Since by Theorem A-6, 
Appendix A, the intersection of two convex 
sets is convex, D' is convex if g^ix), / = 1, ..., 
m, are convex and h^{x)J = 1,...» p are linear. 
The class of problems NLP' which are convex 
is, therefore, quite restricted. 

As will be clear from what follows, con¬ 
vexity is a very desirable property. However, 
in the real world, many optimization prob¬ 
lems are nonconvex. In spite of this fact, the 
study of convex problems is justified. Many 
results which hold only in convex problems 
have led to constructive methods which are 
effective for finding local extrema in noncon¬ 
vex problems. Some of these methods would 
probably never have been developed if only 
general nonconvex problems had been 
treated. 

One of the powerful results which follows 
due to convexity is given in Theorem 4-3. 

Theorem 4-3: A relative minimum in a 
convex programming problem is an absolute 
minimum. 


4-1.3 LOCAL THEORY 

Without convexity it is difficult to say 
much about global properties of the solution 
of NLP or NPL'. Considerable theory is 
available, however, which characterizes local 
minima. The approach in the local theory is 
to suppose that/(x) has a relative minimum at 
a point in Z) or Z>' and then find conditions on 
f(x\ g(x), and h{x) which must hold at this 
point. In this way, many points in D and D' 
may be eliminated as candidates for a relative 
extrema and perhaps relative extrema can 
even be located using these conditions. Such 
conditions, therefore, are called “necessary”. 
In some problems it will be possible to obtain 
a set of conditions that, if satisfied at a point, 
guarantee that this point yields a relative 
extremum. Conditions of this kind, of course, 
are called “sufficient”. 

As often happens in engineering, the engi¬ 
neer needs a powerful result developed in 
mathematics to solve his problem. Proof of 
this result, however, may be very complex 
and, in fact, contribute very little to the 
engineer’s insight into his problems. This 
appears to be the case in many phases of 
optimization theory, in particular, in the 
study of necessary and sufficient conditions 
in nonlinear programming. In the remainder 
of thb paragraph results will be lorrowed 
from mathematical developments. 

Before meaningful results may be given for 
NLP and NLP', the following conditions will 
be required of the constraint functions g(x) 
and h(x). 

Definition 4-5: (First-order constraint 
qualification): Let be a point in the 
constraint set Z)' (or D if there are no equality 
constraints) and let the functions g(jc) and 
h(x) be differentiable at jc°. Then the first- 
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order constraint qualification holds at jc° if 
for any nonzero ye^PT such that < 0 

for each i with = 0 and Vh(x^)y - 0, 

then y is tangent to a differentiable arc 
passing from x® into the constraint set. 

Geometrically, this definition says that if 
the vector y is a direction which, to first 
order, appears to point from x® into the 
constraint set, then there is a curve with y as 
tangent which actually passes from x® into 
the constraint set. The conditions )y < 
0 for g,(x®) = 0 and V/i(jc®) = 0 are just first 
order perturbations of g^Cx) and /i(x) which 
indicate that a small move in the y-direction 
ought to do the right thing to g^(x) and 
h(x). This is illustrated in Fig. 4-2. 



Figure 4-2, First-order Constraint 
Quaiification 


While all constraints do not satisfy the 
first-order constraint qualification, the follow¬ 
ing theorem (Ref. 1, page 19) identifies a class 
of constraints which do. 

Theorem 4-4: If g{x) and h{x) are differ¬ 
entiable at X® in D' and if the gradients 
V^;(x®), for I with g^(x®) = 0, and Vhj(x^) 
are linearly independent, / = 1,..., p, then 
the first-order constraint qualification is satis¬ 
fied. 


In this result, and in fact, in the remainder 
of this paragraph, the problem NLP' is de¬ 
scribed. It is clear, however, that putting p = 0 
in NLP' yields NLP. One of the principal 
results of nonlinear programming may now be 
stated. For proof the reader is referred to Ref. 

1, page 20. 

Theorem 4-5: (Kuhn-Tucker Necessity 
Theorem): Let the functions /(x), g(x), and 
h{x) be differentiable and let the constraint 
functions satisfy the first-order constraint 
qualifications at a point 5c in Z)' of NLP'. In 
order that x be a relative minimum for NLF 
it is necessary that there exist multipliers 
veR'" and such that 

v^ > 0,/ = 1,..., m (4-6) 

v^/(3c) = 0,1 = 1,...» m (4-7) 

and 

Vl(3f.v.w) = 0 (4-8) 

where 

L (x. V. w) = f{x) + v^g(x) + w^h[x) (4-9) 
is called the Lagrangian. 

In a sense. Theorem 4-5 is an existence 
theorem. It asserts that if x yields a relative 
minimum for NLP', then the multipliers v and 
w exist and that Eq. 4-8 is satisfied. Occa¬ 
sionally, one will run across an argument 
attempting to justify this theorem which 
states that 

f(x) = fix) + v^gix) + w^/i(x) = L(x.v.w) 

since v is defined by Eq. 4-7 and /i = 0. It is 
then claimed that since 5c yields a relative 
minimum for /(x) it must yield a relative 
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minimum for I(x,v.w), so VI(x.v.w) = 0 
must hold. This argument is not valid. For a 
rigorous proof of Theorem 4-5 the reader is 
referred to Ref. 1. 

Theorem 4-6 states additional conditions 
which are required to hold if the functions 
appearing in NLP' have two derivatives. 

Theorem 4-6: (Second-order Necessary 
Conditions): Let f{xX g{x), and h(x) have 
two continuous derivatives at a point x in D\ 
Further, let the vectors for all i with 

g^(3c) = 0, and V/i(3c) be linearly independent. 
If X yields a relative minimum for NLP', then 
it is necessary that there exist v and w 
satisfying Eqs. 4-6, 4-7, and 4-8. Further, for 
every yeR" such that Vg^(3c)y = 0 when g^(x) 
= 0, and VhO^y = 0, it is necessary that 

V^L(5F,v.w)y > 0 (4-10) 

For proof of this theorem, see Ref. 1, page 
25. Note that the existence of v and w 
satisfying Eqs. 4-6, 4-7, and 4-9 is a conse¬ 
quence of Theorem 4-5. Even though this 
theorem involves second-order conditions, it 
still gives only necessary conditions. 

A theorem which gives conditions which, if 
satisfied at some point, are sufficient to 
guarantee that this point yields a relative 
minimum for NLP' will now be stated. For 
proof of this theorem, see Ref. 1, page 30. 

Theorem 4-7: (Second-order Sufficient 
Conditions): Let f(x), g(x), and h(x) be twice 
differentiable functions at a point 5c. If for 
xeZ>' there exist v and w satisfying 

Vj> 0 ,1 = 1 ,..., m 

= 1,..., m 

VL(3c.v.w) = 0 


and if for every nonzero yeR” such that 
= 0 for > 0, Vgy(3c)>' < 0 forg/jc) 
= 0 and Vf = 0, and Vh(x)y = 0, it is true that 

y^V^L(5c.v,w)y > 0 (4-11) 

then 3c yields an isolated relative minimum for 
NLP'. 

It should be noted that there is a gap 
between the sufficient conditions of Theorem 
4-7 and the necessary conditions of Theorem 
4-6. Strict inequality is required in Eq. 4-11 
for a larger set of vectors y that may yield 
only equality in Eq. 4-10. It is doubtful that a 
single, tractable set of conditions exist that 
are both necessary and sufficient for the 
general problem NLP'. 

There is one class of nonlinear pro¬ 
gramming problems in which conditions may 
be given that are both necessary and sufficient 
for an absolute extremum. This class is the 
convex programming problem. 

Theorem 4-8: Let fix) and g^(jc), / = 1,..., 
m, be continuously differentiable and convex, 
then necessary and sufHcient conditions for 3c 
to be an absolute minimum point of NLP are 
that there exists veR'" such that 

0 

v^,i^ = 0,i= \ 

V, > 0,/= 1,..., m 

and 

7 /( 3 c )+ 2 v.^gAT) = 0. 

/=l ' ' 

The technical presentation of par. 4-1 ends 
with this satisfying result. Several comments 
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are, however, appropriate at this point. The 
analytic necessary and sufficient conditions of 
par. 4-1 could be used to construct solutions 
of NLP by solving systems of nonlinear 
equations. This is particularly true of the 
results of Theorem 4-8. If one reads the 
current literature, however, he is led to the 
distinct conclusion that iterative methods 
based on successive improvements are too 
effective to bypass in favor of methods that 
require solution of complicated, nonlinear, 
algebraic equations. 

Even if the results of par. 4-1 are never 
used by the designer to construct solutions of 
nonlinear programming problems, they are 
still very powerful tools. Verification of the 
hypotheses of one of the theorems may mean 
the difference between going onto the com¬ 
puter with the comforting knowledge that a 
unique solution exists as opposed to the 
frustrating experience of having computer 
print-out which may be meaningless. 

4-2 THEORY OF FINITE DIMENSIONAL 
OPTIMAL DESIGN 

The nonlinear programming problems of 
par. 4-1 are quite general and may be applied 
to a variety of optimization problems. As is 
frequently the case with very general formula¬ 
tions of problems, special features of some 
problems within the class being studied are 
not exploited. This appears to be the case 
when general nonlinear programming theory 
is applied to solve optimal design problems. 
Interpretation of certain of the variables and 
constraints in the problem NLP', in the 
context of optimal design, yields very effective 
computational methods of solution. This 
paragraph will be devoted to stating the finite 
dimensional optimal design problem, drawing 
an analogy with NLP', and stating necessary 
and sufficient conditions that follow directly 
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from the theorems stated in the preceding 
paragraph. 

4-2.1 FINITE DIMENSIONAL OPTIMAL 
DESIGN PROBLEMS 

The class of problems to be treated in this 
paragraph is, in a sense, a special case of the 
nonlinear programming problem NLP'. How¬ 
ever, by developing a theory for the new class 
of problems which takes advantage of its 
special features, a more efficient solution 
algorithm may be obtained. 

The general optimal design problem must 
have several of the features of NLP'. Namely, 
it is required to have a cost (return) function 
which is to be minimized (maximized) and a 
set of constraints that describe the perfor¬ 
mance demanded of the object being de¬ 
signed. It is in the representation of con¬ 
straints that the optimal design problem 
differs from NLP'. 

In most problems of design in the real- 
world the object being designed is required to 
behave according to some law of physics. This 
behavior is described analytically by a set of 
variables called state variables. Further, there 
is a second set of variables that describe the 
object itself rather than its behavior. These 
variables are called design variables since they 
are to be chosen by the designer so that the 
object being designed performs its required 
function. It generally happens that the laws of 
physics that determine the state variables 
depend on the design variables so the two sets 
of variables are related. 

To illustrate the difference between state 
and design variables, consider the following 
design problems: 

1. Find the coefficient of damping in an 
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automobile shock absorber so that peak ac¬ 
celeration in the passenger compartment due 
to road conditions is as small as possible. 

The coefficient of damping is the design 
variable since it describes the object being 
designed, and its magnitude is to be fixed by 
the designer. Acceleration on the other hand 
is a state variable since it describes the 
behavior of the object being designed. 
Further, this state variable may be determined 
by Newton’s laws of motion. Note that the 
designer has no direct control over the state 
variable. He may effect it only indirectly by 
adjusting the design variable. This is typical of 
state and design variables. 


2. Determine the size of beams to be used 
in a structure so that when a given set of loads 
are applied stresses are within certain given 
limits, the deflection of certain points on the 
structure is within given limits, and the 
structure is as light in weight as possible. 

Beam sizes are the design variables in this 
problem since they describe the structure 
being designed and they must be chosen by 
the designer. Stress and deflection, however, 
are state variables that are determined by 
equilibrium and force deflection relations. 
Again, the designer has no direct control over 
stress and deflection. He may effect these 
quantities only by varying the size of beams 
in the structure. 

In most real-world design problems the 
state and design variables are clearly identi¬ 
fied. In what follows, the state variable will be 
an n-vector, ze/?", and the design variable will 
be a /c-vector, The basic elements of the 
optimal design problem are described by 
Definition 4-6. 

Definition 4-6: The finite dimensional 


optimal design problem (OD) is a problem of 
determining to 


minimize f(z,b) 
subject to 
hiz.b) = 0 
^(z,b) < 0 
where 


(4-12) 


(4-13) 

(4-14) 


h{z,b)^ 


Hz.b) = 


hrizM 


Kiz.b) 


(l>iiz.b) 


<t>^iz.b) 


(4-15) 


and all the functions of the problem are re¬ 
quired to have first-order derivatives. Further, 
it is required that the (n + k) vectors 



(4-16) 


are linearly independent for all i with <t>f{z,b) 
= 0 and that the matrix 


0Z 


(4-17) 


is nonsingular. 


The assumption that the matrix — is 

nonsingular guarantees, by the implicit func¬ 
tion theorem (Ref. 4, page 181), that for 
given b there is a unique solution of Eq. 4-13 
for z. Further, the state variable z, determined 
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from Eq. 4-13 as a function of is differ¬ 
entiable with respect to b. This fact will be 
needed later when constructive methods are 
developed. 


4-2.2 LOCAL THEORY 


Since it is very seldom that the state 
equations (Eq. 4-13) are linear in both z and 
by convexity of the constraint set and hence 
the problem will be rare. For this reason, no 
global results based on convexity will be 
discussed. In case Eq. 4-13 is linear, however, 
global results may be obtained by applying 
the Theorems 4-3 and 4-8. 


It is clear that if a new variable xe/?" ^ is 
defined as 


X = 



(4-18) 


then the problem OD may be put into the 
form NLP'. According to Theorem 4-4, the 
first-order constraint qualification will be 
satisfied for OD (with xe/?'* ^ ^ as indepen¬ 
dent variable) if the row vectors 



for/ with ^f(z,b) = 0 (4-20) 

are linearly independent. Theorem 4-5 may 
now be applied to the problem OD. 

Theorem 4~9: (First-order Necessary Con¬ 
ditions): Let all the functions appearing in 
OD be differentiable at a point F, B which 
satisfies Eqs. 4-13, 4-14, and 4-15. Further, 
let the vectors, (Eqs. 4-19, 4-20, and 4-21) be 
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linearly independent at z^B. Then there exist 
multipliers Xe/?'* and /le/?"*, with m > 0 such 
that for 

H=f{z.b) + \^h(z.b) + tJ<Hz.b) (4-21) 

bH __ 

(z,b) = 0 (4-22) 

dff .- 

— (z,b) = 0 (4-23) 

and 

M;0;(F,b) = O,/=1,..., m. (4-24) 

The proof of this theorem may be con¬ 
structed by simply writing down the 
necessary conditions of Theorem 4-5 in terms 
of X and then separating the components of x 
as in Eq. 4-18. 

In exactly the same way the second-order 
necessary and sufficient conditions of 
Theorems 4-6 and 4-7, respectively, may 
be stated for the problem OD. No essential 
simplification of the statements of those 
theorems occurs, however, so the theorems 
are not restated here. 

Theorem 4-9, just as Theorem 4-5, is 
difficult to use in constructing solutions of 
OD. Considerable difficulty arises because one 
does not know which of the inequalities in 
OD is an equality. For problems with a small 
number of inequality constraints this may not 
be a difficult obstacle, particularly if the 
designer has a good intuitive idea of which 
constraints will be equalities. If, on the other 
hand, there are a large number of inequality 
constraints, then the number of combinations 
of constraints which may be equalities is 
large. It is, therefore, difficult to determine 
just which combinations will be equalities. An 
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analytic solution is extremely difficult in this 
case. 

Rather than attempt to use the necessary 
conditions to construct candidate solutions, a 
more direct approach will be followed. The 
remainder of this chapter will be devoted to 
direct methods of solving NLP, NLP', and 
OD. 

4-3 SEQUENTIALLY UNCONSTRAINED 
MINIMIZATION TECHNIQUES (SUMT) 

A favorite method of solving difHcult 
problems, particularly among mathematicians, 
is to reduce a difficult problem to a sequence 
of easy problems. Each of the easy problems 
is solved and if the method is any good, the 
sequence of solutions of easy problems will 
converge to the solution of the difficult 
problem. As the title might imply, SUMT 
follows just this pattern. It should be clear 
that a central part of this method must be 
results which guarantee convergence, at least 
in cases where solutions are known to exist. 

The method presented here essentially re¬ 
duces NLP and NLP' to a sequence of 
auxiliary problems which may be solved by 
the methods of Chapter 2. The cost function 
of NLP or NLP' is augmented by a function 
called a penalty function. The penalty func¬ 
tion is formed from the constraint functions 
in such a way that as a parameter approaches 
zero (or perhaps infinity) the unconstrained 
minimum of the augmented cost function 
converges to the solution of NLP or NLP'. 
Two basically different ways of doing this are 
presented here. Each has its computational 
and theoretical advantages and disadvantages 
that will be described later. 

Due to the large body of theory concerning 
SUMT, results will be presented in this para¬ 
graph without proof. The reader is referred 
for proofs and an extended discussion of 
SUMT to the complete and well-written text 


of Fiacco and McCormick (Ref. 1). Theoret¬ 
ical results guaranteeing convergence are 
presented here to indicate the level of the 
known theory of SUMT, rather than as a 
complete treatment of the subject. 

4-3.1 INTERIOR METHOD 

The interior SUMT is based on the idea of 
using the constraint functions to erect a 
barrier at the boundary of the constraint set 
D of NLP by adding a penalty function to 
fix) which approaches infinity as the bound¬ 
ary of D is approached from the interior. 
Once the solution of the augmented problem 
is obtained, the penalty function is altered so 
as to effect fix) less in the interior of D, This 
behavior is illustrated in Fig. 4-3. 




Figure 4-3. Penalty Functions 
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As illustrated in Fig. 4-3, when the penalty 
function is decreased on the interior of Z), the 
minimum of the second augmented cost 
function is closer to the solution x than 
the minimum of the first augmented cost 
function x^^ ^ The idea, of course, is that the 
sequence of points x^^^ generated in this way 
converges to x. 

It should be clear why this approach is 
discussed only for NLP and not NLP^ The 
constraint set of NLP' can have no interior 
due to the equality constraints. It is possible 
that NLP has no interior and in this case the 
interior SUMT is not applicable. In what 
follows, it is assumed that the constraint set D 
of NLP has an interior. 

The sequence of points x^^^ which is to 
converge to the minimum point is generated 
by minimizing 

/(x) + 5(r,) /(X) (4-25) 

without regard to constraints, where S(r,.) /(x) 
is continuous for x in the interior of D and 
S{r^) /(x) = + oo for any x such that gj(x) = 0 
for any \ < j < m.\i is clear that if one begins 
an iterative minimization technique of Chap¬ 
ter 2 at a point in the interior of Z), then a 
relative minimum point will be found which 
must lie in the interior of D. Otherwise, the 
minimizing sequence would have had to climb 
over a portion of the auxiliary cost surface 
that is infinitely high and none of the 
methods will do this. 

In order to obtain the sequence of points 
x^^\ the parameter r. is allowed to approach 
zero. To insure that the sequence x^^^ con¬ 
verges to a relative minimum point, the 
functions /(x) and S{r) are required to have 
the following properties: 


1. /(x) is continuous and non-negative on 
the interior of the constraint set D and if 
{ x^ } is any sequence of points in R” 
converging to x where gy(x) = 0 for some /, 
then^*!;;"^ /(x*) = + «>. 

2. S(r) is continuous and if > '’2 > 
then 5(ri)> Sir^) 0 and if is a sequence 
of numbers converging to zero, then Sir^) 
= 0 . 

Probably the most common penalty func¬ 
tions /(x) and Sir) are 

/(x) = - 2 -L (4-26) 

/-I g/M 

and 

5(r) = r. (4-27) 


Any pair of functions satisfying properties 
No. 1 and No. 2 associated with Eq. 4-25, 
however, is suitable. It may be to the 
designer’s advantage to choose another form 
for any particular problem. For other suitable 
choices of penalty functions, see Ref. 1, page 
68 . 

The algorithm for solving NLP by the 
interior point technique is given in Definition 
4-7. 

Definition 4-7: The interior point sequen¬ 
tially unconstrained minimization algorithm is 
given by the following; 

Step 1. Define the function 

U(x.r) = f(x) + Sir) Kx), (4-28) 


4-12 
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where S(r) and /(x) satisfy prop¬ 
erties No. 1 and No. 2. Choose 
> 0 and in the interior of the 
constraint set D, 

Step 2. Beginning at minimize U{xJq) 
without regard to constraints to 
obtain Any of the methods 
of Chapter 2 may be employed for 
this purpose. 

Step 3. For / = 0,1,2,.... choosei > 0 
such that r. ^ ^ < r^. Beginning at 
x^^^ minimize f/(x,r^ ^ j) without 
regard to constraints to obtain 
xC^'*' where i is the iteration in¬ 
dex. 

Step 4. As r^-^oo, if -x^^^ I and 

ly [x(^ ■»* 0]_y [x(0] I are suffi¬ 
ciently small, terminate the process 
and take x^^ ■*" ^ ^ as the solution of 
NLP. Otherwise return to Step 3. 

In order to be sure that this algorithm will 
lead to a solution of NLP, one would like to 
have a result that as 0, a solution is 
approached. Such a result is contained in 
Theorem 4-10. 

Theorem 4-10: In the interior point 
algorithm just given let: 

AxX gjW, ..., g^(x) be con¬ 
tinuous on the constraint set D, (4-29) 

Sir) and /(x) satisfy properties No. 

1 and No. 2, (4-30) 

The interior of D be nonempty, (4-31) 

There be a relative minimum point 

X in D such that /(x) < fix) for all x 

# X in some neighborhood of x. 


where x is not an isolated point of 
A (4-32) 

I } be a strictly decreasing 
sequence which converges to zero. (4-33) 

Then for x<0> sufficiently near x 
and r^ sufficiently small, 

^(/) = j (4.34) 

Further, 

jllf. 5(r,)/[x<')] = 0 (4-35) 

(4-36) 

is monotone decreasing (4-37) 
and 

{/[x^^^ ]} is monotone increasing. (4-38) 


For proof of this theorem see Ref. 1, page 
47. 

It has been noted throughout the previous 
development that if NLP is convex - i.e., 
/(x), gj(x),.. .,g^ (x) are convex - then 
“nice” things happen. One of these “nice” 
things is given in Theorem 4-11. 

Theorem 4-11: If NLP is convex with a 
unique minimum point x, g^ix\ / = 1,..., m, 
are twice continuously differentiable, and if 
Eqs. 4-29 through 4-33 hold, then x^^^ gen¬ 
erated by the given algorithm will converge to 
the minimum point. 

It should be noted that Step 1 of the 
algorithm (Def. 4-7) required a point x^®^ in 

4-13 
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the interior of the constraint set but no 
method of obtaining such a point was given. 
This question will be addressed later in this 
paragraph. 

Example 4-3: Solve the LP 

“*■ ^2 “ niinimum 

= - Xj < 0 

-^2) = - X2 < 0 

using the interior point SUMT. 

Solution: 


C/(x,r) = Xj +X 2 - r 




The functions f{x\ gj(x), and ^ 2 ^ 
convex and by Theorem A-5, Appendix 
A, so are - l/gj(x) and 1/^2^)- Since r > 0, 
U(xj) is convex and thus has a unique 
minimum. To find it, put 



1 - 




^ . n - 1 _ 

9^2 (X2)2 

so 

Xj = ri/2 
X 2 = 

As r 0, Xj -►0 and X 2 -► 0 so the solution 
of Example 4-3 is 

(Xj,X2) = (0,0). 


4-14 


4-3.2 EXTERIOR METHOD 

Unlike the interior method, starting points 
for the exterior SUMT are not required to be 
in the constraint set of NLP. The basic idea in 
the exterior method is to add to the cost 
function a penalty function that is positive 
for points outside the constraint set and zero 
inside the constraint set. This, in effect, 
discourages the minimum of the new 
augmented cost function from being too far 
from the constraint set if the original cost 
function f{x) is “well behaved” outside the 
constraint set. It is clear that this approach 
may not be taken if /(x) is undefined or takes 
on negatively infinite values outside the con¬ 
straint set. One very appealing aspect of the 
exterior method is that it handles equality as 
well as inequality constraints without diffi¬ 
culty, so that it can be used on NLP'. 

The penalty function employed for the 
exterior method will have the form 

P(t) EM (4-39) 

where P(t) and EM are required to satisfy the 
conditions: 

1. £(x) = 0 if X is in the constraint set, and 
£(x) > 0 if X is outside the constraint set. 

2. P{t) is continuous and if ^2 > > 0. 

then £(^ 2 ) > E(t^) > 0. Further, if + 

Probably the most common choice for I\t) 
and EM is 

no = t (4-40) 

and 


f,(x)=^Z [«/x) + |g/x)|)^ (4^1) 
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and 

^2 = 2 lhXx)V 

/*i ' 

where 

£ = £i +£2. 

The basic idea for the exterior method was 
given by R. Courant in 1943 (Ref. 5). He 
argued that if 

(4-42) 

were minimized without regard to constraints 
using ^1 and ^2 with ^2 > since the 

augmented cost function is penalized more 
when t 2 is used than when ti is used, the 
minimum point corresponding to ^2 should 
be closer to the constraint set and hence, 
closer to the minimum point of fix) on the 
constraint set. 

An explicit algorithm for solving NLP or 
NLF by this method is given in Definition 
4-8. 


obtain an unconstrained minimum 
point of 

T(x.t,)=f(x)-^P(t,)Eix) 

denoted 

Step 4. As -►«>, if II [x^^^ - I 

and l/U^^M - I are 

sufficiently small, terminate the 
process and take as the solu¬ 
tion of NLP. Otherwise, return to 
Step 3. 

Very much as in the case of the interior 
method. Theorem 4-12 guarantees a certain 
measure of success. 

Theorem 4-12: In the exterior point al¬ 
gorithm let: 

fix), g (x), (x) be continuous 

for all X. (4-43) 

Eix) and Pit) satisfy conditions No. 

1 and No. 2 of Eq. 4-39. (4-44) 


Definition 4-8: The exterior point sequen¬ 
tially unconstrained minimization algorithm is 
given by the following: 

Step 1. Make an engineering estimate 
of the solution of NLP or NLP'. 

Step 2. Choose t^ > 0 and beginning at 
find an unconstrained mini¬ 
mum point of 

7’(x,^)=/(x)+£(r,)£(x) 

denoted x^^K 

Step 3. Continue with / = 2, - by choosing 
t^ > and starting from 


There be a relative minimum point 
X in that admissible domain D such 
that /(3c) < fix) for all x # 3c in 
some neighborhood of 3c, where 3c is 


not an isolated point of D. 

(4^5) 

The sequence | | is strictly 

creasing to + «>. 

in- 

(4-46) 

Then for x^®^ sufficiently close to x, and t^ 
sufficiently large, 

jrr('> = ? 

(4^7) 

/»(f,)£[x(')] =0 
!-► .0 1 

(4^8) 
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=/(3c) (4-49) 

/-► oo 

Tlx^<\t.]=m (4-50) 

I oo f 

{/U<'>]} is monotone decreasing (4-51) 
is monotone decreasing. (4-52) 

For proof of this theorem, see Ref. 1, page 
57. 

Very much as in the interior method, if the 
NLP or NLP' is convex, then convergence is 
guaranteed by Theorem 4-13. 

Theorem 4-13: If NLP or NLP' is convex 
with a unique minimum point, and if Eqs. 
4-43, 4-44, and 4-46 hold, then regardless of 
the estimates and the sequence 
generated by the algorithm given by Theorem 
4-12 will converge to the minimum point. 

Example 4-4: Solve 

fix i,X 2 ) = Xj + 2x1 = minimum 

hixx,X2) = Xi +^2 - 1=0 
by the exterior point SUMT. 

Tixj) = x\ +2x^ +r(xi +X2 - 1)^ 

dT 

r— = 2xi + 2t(xi +^2 - 1) = 0 

bXi 


-- = 4^2 + 2tiXi +^2 - 1) = 0 

0 X 2 

Subtracting, 

4^2 -2jci =0, orxi =2x2- 

Then 

4-16 


4x2 +2r(3x2 - 1) = 0 
and 

As 

r X2 “^“and Xi 

3 3 

The solution is then 

4-3.3 MIXED INTERIOR-EXTERIOR 
METHOD 

Both the interior and the exterior methods 
presented in pars. 4-3.1 and 4-3.2 are not 
applicable in certain kinds of problems. In 
particular, the interior method cannot be used 
if the interior of the constraint set is empty, 
such as in the case with equality constraints. 
The exterior method cannot be used if some 
constraint function is not defined or is ill- 
behaved outside the constraint. A combina¬ 
tion of the two methods will now be given 
which allows the treatment of problems 
which may have both these undesirable fea¬ 
tures and thus could not be treated by either 
pure interior or exterior methods. 

For convenience, consider NLP' 

minimize/(x) (4-53) 

subject to 

g^ix) < 0, / = 1,..., m (4-54) 

/i^(x) = 0,/= 1,..., p (4-55) 
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where the set of all points which satisfy the m 
inequalities Eq. 4-54, has an interior. As 
might be expected, the constraints, Eq. 4-54, 
will be dealt with using an interior point 
penalty function and the constraints, Eq. 
4-55, will be dealt with using an exterior 
point penalty function. 

The penalty function used here will be 

5(r,)/(x)+/>a,)£(x), 

where S(r)I{x), P{t), and E{x) satisfy condi¬ 
tions No. 1 and No. 2 preceding Eq. 4-26, and 
No. 1 and No. 2 of Eq. 4-39. It is understood 
that /(jc) is a function of only the constraint 
functions in Eq. 4-54 and £(x) is a function 
of only those in Eq. 4-55. A general minimiz¬ 
ing algorithm for NLP is now given in 
Definition 4-9. 

Definition 4-9: The mixed interior-exterior 
sequentially unconstrained minimization al¬ 
gorithm is given by the following: 

Step 1. Make an engineering estimate x^^^ 
of the solution of NLP'. 

Step 2. Choose r, > 0 and > 0 and 
obtain an unconstrained minimum 
of 

K(x.r„r,)=/(x) + 5(r,)/(x) 

+ P(r,)£(x), (4-56) 

denoted 

Step 3. Continue with i = 2,... by choosing 
'■/ < '■/.j and and starting 

from finding an uncon¬ 

strained minimum point of 

=fix) + Sir^) lix) 

+ P(r,)£(x) (4-57) 


denoted 

Step 4. As -► 0 and r -► + «>, if — 
andl/Ix^'^J | are 

sufriciently small, terminate the 
process and take x^^^ as the solu¬ 
tion of NLP'. Otherwise return to 
Step 3. 

As might be expected from a study of the 
two methods which were combined to form 
the mixed method, a convergence result is 
given by Theorem 4-14. 

Theorem 4-14: In the mixed point al¬ 
gorithm let: 

gi(x), ...» g^(x) be continuous on 
the nonempty interior of their con¬ 
straint set and /(x), /ii(x),..., h^{x) 
be continuous for all x. (4-58) 

S(r\ /(x), P{t), and £(x) satisfy 
conditions No. 1 and No. 2 preced¬ 
ing Eq. 4-26, and No. 1 and No. 2 
of Eq. 4-39. (4-59) 

There exist a relative minimum 
point X in the admissible domain D' 
of Eqs. 4-54 and 4-55 combined, 
such that f(x) < / (x) for all x # x 
in some neighborhood of 3c, where 
3c is not an isolated point of D\ (4-60) 

The sequence j } be strictly de¬ 
creasing to 0 and | } be strictly 

increasing to + «>. (4-61) 

Then for x^®^ sufficiently close to 3c. 
sufficiently small, and t^ sufficiently large, 

5(rp/(x<')) =0 (4-62) 
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/»(/,) =0 


(4-63) 


and 

r,,/,] =/(jF). (4-64) 

f -P 0* » • 


For proof, see Ref. Ijpage 60. 
Example 4-5: Solve 
fix 1 ^X 2 ) = -Xi •^X 2 = minimum 
gix 1 ^X 2 ) = -flnx 2 < 0 

h(Xi,X2)-Xi +X 2 -1=0. 


Solution: 

Since ^ 2 ) is unbounded at X 2 = 0, it 
must be treated by the interior method and 
since = 0 prevents the constraint set 

from having an interior, it must be treated by 
the exterior method. From 


Vix,r,t) - -Xi +X 2 + 


inx2 

+ tiXi +X2 - D* 


(4-65) 


^ = - I + 2t(Xi +X2 - 1) = 0 (4-66) 

dx, 




dX2 X2iln^X2 

+ 2r(jCi +X 2 - 1) = 0 (4-67) 

Subtracting Eq. 4-67 from Eq. 4-66, 
r 


X2in^X2 

so 

4-18 


= 2 



Since gix 1 ^X 2 ) < 0 is satisfied at all times, X 2 
> 1. Talcing the limit as r 0, then, JC 2 1. 

As r -►+«>, it is necessary that Xi + X 2 — 1 

0 or Eq. 4-66 will be violated. Therefore, in 
the limit X] = 0. 

The minimum point is, therefore, (JC 1 .X 2 ) = 

( 0 , 1 ). 

4-3.4 DETERMINATION OF AN INTERI¬ 
OR POINT 

In order to begin the interior point or the 
mixed interior-exterior point algorithm, it is 
necessary to have a point x^^^ which satisfies 
a certain set of inequalities, i.e., a point 
interior to a given constraint set. Let this set 
of inequalities be 

gfix)< 0,/= 1,..., m. (4-68) 

If there are other inequalities or equalities 
which will be treated by the exterior point 
method, they are ignored for now. 

Let be a first estimate of an interior 
point of the set defined by Eq. 4-68. Denote 
the inequalities of Eq. 4-68 which are not 
strictly satisfied by 

N= { > 0 } 


and the inequalities which are strictly satisfied 
by 


K= {< 0 }. 

The object now is to move from to 
points where successively more inequalities 
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move from N into K; i.e., so that no con¬ 
straint which was previously satisfied is vio¬ 
lated, but constraints which were previously 
violated are satisfied. This may be accom¬ 
plished by minimizing the unconstrained cost 
function 


U(y.r,)= S g,(y) + S(r,) ^ Ilg,(y)] 

• iGN ' ^ i&K ' 

with respect to y, where S(r) and I(x) satisfy 
the four constraints - No. 1 and No. 2 
preceding Eq. 4-26, and No. 1 and No. 2 of 
Eq. 4-39 - and is a strictly decreasing 
sequence. The result is denoted 

As soon as is such thatg^fy^^] < 0 for 
some I previously in N, that constraint func¬ 
tion is switched to K. In this way, constraint 
functions from N may get to K but those in K 
may never fall back to N. Once all the 
constraints in N are switched to AT, the 
process is stopped and the resulting y^^ is in 
the interior of the constraint set of Eq. 4-68. 
If the minimum of U(y,r^) is found as 0 
and there are still constraints in N, then the 
constraint set defined by Eq. 4-68 has no 
interior. In this case, NLP is infeasible (has no 
solution) or certain of the constraints of Eq. 
4-68 will have to be treated by exterior point 
methods. 


4-4 STEEPEST DESCENT METHODS FOR 
NLP 


derivative information to make successive 
improvements in an estimated solution of 
NLP. A study of the problem NLP' will be 
better included in the next paragraph. 

Geometrically, the method presented here 
will first investigate the direction of most 
rapid decrease in the cost function f{x). As 
seen in par. 2-4, this is - This 

direction is then projected onto the tangent 
hyperplane to the boundary of the constraint 
set at X. A small move in the resulting 
direction will then decrease f{x) and will not 
cause excessive violation of constraints. This 
process is repeated as long as f{x) may be 
decreased. 

Instead of basing the derivation of the 
method on a geometric argument, the work 
will all be done analytically. The reason for 
this is twofold. First, geometric ideas in 
higher dimensions are not always as clear as 
those in two and three dimensions. Second, 
the analytical method used here wiU be 
employed in deriving algorithms in con¬ 
tinuous problems where geometric concepts 
are much more difficult. 


Extensive use will be made of matrix 
calculus notation in this paragraph. 


Recall that for g(x) = 


gi M 
g„(x) 


, xeR" 


In Chapter 2 a gradient method is pre¬ 
sented for finding the minimum of an uncon¬ 
strained function. Such a direct method has 
properties that make it attractive and worth 
developing for the solution of NLP. It is clear, 
however, that due to constraints the gradient 
method studied earlier does not apply directly 
to NLP. It is the object of this paragraph to 
develop a method which uses only first 



Further, the symbol 


6x 


6x 


1 
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will denote a change in x and a 5 in front of a 
quantity which depends on x will denote the 
first order change in that quantity due to the 
change bx in x. For example, for the scalar 
function/(x), 

9/ 

6/U) = ^ 6jc. 
bx 

Note that this first order change is just the 
first term in a Taylor expansion, Ref. 4, page 
84, of f(x\ so bf(x) is an accurate approxima¬ 
tion of the change in f{x) only for small 6x. 

The method to be developed here resem¬ 
bles an interior method in the sense of par. 
4-3. Therefore, the method of generating an 
interior point (one which satisfies all the 
constraints) presented in par. 4-3 may be 
utilized to obtain a starting point. It is 
assumed now that his has been done, and that 
an estimate of the solution of NLP is 
available which satisfies 

gixW) < 0. 

44.1 THE DIRECTION OF STEEPEST DE¬ 
SCENT 

If the point x^®^ is in the interior of the 
constraint set, then the gradient method of 
par. 24 applies and the direction in which 
x^®^ should be altered is 

6x = -At[ x<°>] , (4-69) 

3x 

k > 0. 

In the remaining case, the point x^®^ is on 
the constraint boundary so gjx^^^l = 0 for 
some /. For convenience define the set 

A(x) = { i I g,(x) = 0 } , (4-70) 

4-20 


i.e., the collection of indices of constraint 
functions which are equalities at the point x. 

The object is now to find a direction of 
change 6x, bx^bx = 1, such that 6x = kbx for 
sufficiently small A: > 0 will decrease fix) 
without violating any constraints. The prob¬ 
lem is then to find 5x such that 

if = ^ [x<®>] Sx 
dx 

is minimum subject to 

^ U<°>)6x < 0, 

' 9x 

/€4[X<®>] 

and 

6x^6x= 1. 

For further convenience, define the column 
vector of constraint functions which are zero 
as 

r 

Hx) = 

In this notation the problem is 

minimize -^[x^^^Sx (4-71) 

3x 

subject to 

?^[x«»]6x<0, (4-72) 

3x 

6x^6x = l. (4-73) 

It is assumed that at points where several 
g^(x) = 0, the gradients are linearly indepen- 
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dent. This is enough to satisfy the first-order 
constraint qualification (Theorem 44) for 
the constraint, (Eq. 4-72). Theorem 4-5, there¬ 
fore, may be applied to obtain the necessary 
conditions 




ie4 


With this newg, Eq. 4-74 yields 


V 

dx 


+ 




djc 


+ 2X„6;f^ 


0 , 


(4-74) 



where the components, X, > 0, of X corre- 
q>ond to gf with indices in A ]. 

Assume for the time being that - 0 for 
all Then taking the transpose of 

Eq. 4-74 and premultiplying by dg/dx yields 





(4-76) 


dx bx 


bg bg^ - 

if ar ^ ^ '^0 


/6x = 0, 
bx 


or since dg = 0, 

bx bx bx bx 


Note that > 0 is required since if 
is empty, then Eq. 4-76 must reduce to the 
negative gradient direction. 

Putting 


Since the gradients of for 

are assumed linearly independent, 
the coefficient matrix of X is nonsingular and 


X 


dx 3x / 3x 9x ' 


(4-75) 


Eq. 4-76 becomes 


If all components of X are non-neg¬ 
ative, then the assumption that all 
dgf = 0, is valid and 6jc which solves 

the problems of Eqs. 4-71, 4-72, and 4-73, is 
obtained directly from Eqs. 4-74 and 4-75. 
On the other hand, if X^ < 0 for some 
Iey4(jc^®^], then this component of g is 
removed from g. Equivalently, is 

redefined as 

4[x<“>] = {/| g,[x<0>] =0andX,> 0} 
andg[x<®^) is redefined as 


^ 1 bf^ 

Substituting this into Eq. 4-73, 


Solving for l/2Xjj, 6jc becomes 

d/r 

(4-78) 
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Eq. 4-78 gives a unit vector bx in the 
constrained direction of steepest descent at x 
= The problem is now one of deter¬ 
mining just how big a step should be taken, 
Le., 

dx - kdx (4-79) 

where k> 0 must be chosen. 

Before the problem of step size is treated, 
however, an algorithm may be stated for 
determination of the direction of steepest 
descent, namely: 

Step 1. Using the method of par. 4-3, 
obtain an estimate of the solution 
of NLP, which is in the 

constraint set. 

Step 2. Let / > 0 denote the number of the 
present iteration. Compute 
and form the 
set Alx^^. Compute bf/bxlx^^] 
and bgjbxlx^^ ] for mA [x ^^^). 

Step 3. Compute X in Eq. 4-75. For all X^ 
< 0, delete i from A [jc^^ ] to form 

Step 4. Compute P in Eq. 4-77 and 6jc in 
Eq. 4-78. If P = 0, then this is the 
solution of NLP. 


Example 4-6: Compute the direction of 
steepest descent at the point (2,2) for the 
NLP 

minimize/(jCj,X 2 ) = (^ 2 )^ - x^ 

glix^^X^) = Jfj - ^2 < 0 


^2(^1.^2) = - JC, - *2 + 2 < 0. 

First, 

«,(2,2) = 0 

^ 2 ( 2 , 2 ) = - 2 . 

Therefore, y4[x<*)] = { 1}. As required by 
Step 2 of the Algorithm 

1^(2, 2) = [- 1, 4] 
bx 

^ (2, 2) = [I, - 1) 
dx 

By Step 3, 

sOi4 = A, For Step 4 

'■ra-uK"-" [-;])'■ 


Finally, 
«jc = 



'1/2 l/2'l f- 

1 3/2 

1/2 1 / 2 J L 

U ^ [ 3/2 . 
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4h4^ step size determination 

There are many techniques presented in the 
literature for determining the size of step to 
be taken in the constrained direction of 
steepest descent. Three of these techniques 
are presented here. The first tedinique applies 
to a specialized class of problems in which the 
constraint functions are linear. The second 
and third methods apply to the general 
nonlinear problem. 


jp(/+i) = jp(/) + Iq^x 

if bflbxlx^^'^^^]6x <0. The process in the 
algorithm for the direction of steepest descent 
is repeated from Step 2, and a new step size k 
is computed as above. If, on the other hand, 
bfldx[x^'^^^]dx > 0, then a relative mini¬ 
mum has been bypassed. To locate this 
relative minimum, do a one-dimensional 
search in the direction 5jc starting at x^^ to 
obtain 


44.2.1 ROSEN'S METHOD FOR LINEAR 
CONSTRAINTS 


This process may be summarized in Rosen's 
Algorithm: 


If the constraint functions are linear, then 
once the direction of steepest descent dx is 
found, it may be followed without leaving the 
constraint boundary until a constraint g.(x) - 
0, for I not in Alx^^], This algorithm, 
therefore, can lead to rather long step sizes. 


Step 1. Compute 


min 

k = =o| 




Step 2. Compute 


Constraints here are restricted to the form 

^Ix^f^ + kSx] . 

- dx 

C/x - ft, < 0, i = 1. .... m 

If 

where is an /i x 1 matrix of constants. The 
step size is to be determined so that k is as 

small as possible and still — + k6i ] 6x < 0 


+ k6i] - b, = 0 


put 


for some i^A ]. Only those i need to be 
considered for which G^^dx > 0, since other¬ 
wise this constraint can never go from strict 
inequality to equality. The step size A:, there¬ 
fore, is chosen as 


1 ) =x^/) + kdx 
and go to Step 4. 
steps. If 


k = 


min 




G^bx> 01 
HA Ix</>] I 



^[x</>+Jfc6x]6x> 

dx 


0 , 


then find A: so as to minimize 


The point x^^^ ^ ^ therefore is given by 


+ A:6x]. 
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Put ^ ^ + kbx and go to 

Step 4. 

Step 4. If -/[x<>)]| and||x</^i> 

- are sufficiently small, ter¬ 
minate and take as the 

solution of NLP. Otherwise, return 
to Step 2 of the constrained 
steepest-descent algorithm. 

44.2.2 FIXED STEP WITH VARIABLE 
WEIGHTING 

When the auxiliary problem, Eqs. 4-71 
through 4-73, was formulated, it would have 
been possible to ask for the step size directly 
rather than just the direction of steepest 
descent. In many cases, the behavior of the 
solution is much more sensitive to changes in 
one variable than another. For stability of 
calculation then, rather than asking for a 
direction 6x satisfying Eq. 4-73, the designer 
might request a change 6x in x^^ which 
satisfies 





(4-83) 


Note that if the step size is made large, 
then considerable progress may be made 
toward the minimum point. However, since 
the constraint functions are nonlinear, viola¬ 
tions may occur at any iteration. After a new 
point has been computed, the con¬ 

straint functions should be checked. If any 
constraints are violated in excess of fixed 
tolerance, the method of par. 4-3 may be used 
to move x^^ ^ ^ back into the constraint set. 


6 x^R/6x = 9} (4-80) 

where W is a positive definite matrix (usually 
diagonal) and is a predetermined constant. 
The elements of W are often chosen so that 
expected changes in various components of x. 
5x, will contribute approximately the same 
magnitude to 6x^R^6x. The matrix W, there¬ 
fore, is chosen based on the designer’s ex¬ 
perience. 


The analysis performed in obtaining the 
direction of steepest descent follows with 
only minor changes. The only changes of 
interest, computationally, are 


X 


f bg 

idx 


dx ) 



3 /^ 

dx 


(4-81) 


The computational method is then de¬ 
scribed in Algorithm for Steepest Descent 
With Fixed Step Size: 


Step 1. Using the method of par. 4-3, 
obtain an interior estimate of the 
solution of NLP, x^®\ which is in 
the constraint set. Further, choose 
the weighting matrix W and step 
size fi in Eq. 4-80. 

Step 2. Let / denote the number of the 
present iteration. Compute g^fx^M 
and form the set A [x^^ ]. Compute 
3//3 x(x^M and 3g/3x[x^M for 
i^A [x^^ ]. 

Step 3. Compute X in Eq. 4-81. For all \ 
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< 0, delete i from A ] to form 
Alx^^]. 

Step 4. Compute P in Eq. 4-82 and 6x in 
Eq. 4-83. 

Step 5. Compute x^*^^ = x^^ + 8x. If any 
constraints are violated excessively, 
use the method of par. 4-3 to get 
from x^^^^ back into the con¬ 
straint set. 

Step 6. If \flx^*^ >] JI and \\x ^^^> 

— are sufficiently small, ter¬ 
minate. Otherwise, return to Step 
2 (possibly with altered fi and W), 

44.23 STEEPEST DESCENT WITH CON¬ 
STRAINT TOLERANCES 

In par. 4-4.2.2 it was noted that a step may 
be made so large as to violate a constraint in 
excess of an admissible error. The method of 
choosing step size presented here will prevent 
this difficulty. 

Let reasonable tolerances be assigned to 
constraint functions g^(x). The object here is 
to move in the constrained direction of 
steepest descent until some constraint func¬ 
tion g^(x) violates the tolerance g^(x) > or 
until a minimum of 

flx^) + A:6;tJ 
is reached. 

A uniform step size in k may be chosen and 
steps taken, checking 

for each mAlx^^] and each step in k. The 
multiplier k is increased monotonically pro¬ 


vided flx^^ + kbx] is decreasing, and con¬ 
straints do not exceed the given error toler¬ 
ances. When either fails to hold, the resulting 
point is called x^^ ^ ^. 

If the process is stopped because a con¬ 
straint is violated in excess of its given 
tolerance, the method of par. 4-3 is used to 
obtain a new point in the constraint set and 
the process is repeated until the minimum 
point is located. 

This method should be most effective when 
constraint functions are easily evaluated but 
derivatives are costly in computer time. The 
basic idea of the method is to prevent an 
excessive number of calculations of the con¬ 
strained direction of steepest descent. 

4-4.3 A STEEPEST DESCENT METHOD 
WITH CONSTRAINT ERROR COM¬ 
PENSATION 

In previous subparagraphs, steepest descent 
methods were given which at boundary points 
generated steps parallel to a constraint bound¬ 
ary in a direction which decreased the cost 
function as rapidly as possible. Due to non¬ 
linearity of the constraint functions, and the 
finite step size, however, some constraints will 
invariably be violated. It is the object in this 
paragraph to present a new method motivated 
by the article (Ref. 6) which automatically 
corrects for violation in constraints. 

Let A{x^^] = j 1 1 gflx^^ > 0 } be the 
indices of constraint functions which are zero 
or are violated. As in the preceding develop¬ 
ment of this paragraph first-order Taylor 
approximations will be used to approximate 
functions appearing in NLP. The linearized 
version of NLP at an approximation to the 
solution, is 
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minimizes/= (4-84) 

OJC 

subject to 

^ ] 6x < Ai, (4-85) 

bx 

ku^>n 

where ^ = [_ J and A J is taken as the 
desired change in g, i.e., the total change 
taken at the designer’s discretion. Usually, so 
long as the constraints are not violated ex¬ 
cessively, the full violation may be corrected; 
i.e. 

= (4-86) 

In order that step size is not excessive, it is 
required that 


6 jc^6jc = fi2 (4.87) 

where £ is small. Assuming Eq. 4-85 is an 
equality, necessary conditions for the 
linearized problem are obtained by using 
Theorem 4-5. From 

L= ^Sx + \^ (^^Sx-£^g) 
bx \bx ) 


X. Assuming Eq. 4-85 is an equality, then the 
necessary conditions reduce to only Eq. 4-88 
and Eq. 4-85 as an equality. This system is 
linear and can be solved. The multiplier X can 
then be determined and a check made to see 
whether all components are non-negative. If 
any component is negative, say k, then the 
assumption that Eq. 4-85 is an equality is 
violated and it may be concluded that the A:th 
component of Eq. 4-85 should have been 
allowed to be a strict inequality. The index k 
is then deleted from A . 


Premultiplying Eq. 4-88 by bgibx and using 
Eq. 4-85 yields 


bg bf^ ^ bg bg^ 
dx bx bx bx 


X + 2/38g = 0. 


It is assumed, as usual, that the gradients of 
all constraint functions which are zero or 
violated are linearly independent. Therefore, 
the coefficient matrix of X is nonsingular and 





Substituting Eq. 4-89 into Eq. 4-88 yields 


and Theorem 4-5, it is necessary that X^ > 0, 
and 

bf' bg^ 

^+^X+2/36x= 0 (4-88) 

bx bx 



and 

h(^ =0. 

This set of equations is nonlinear in X and 


(4-90) 

This expression for bx could now be substi¬ 
tuted into Eq. 4-87 to find p. To be more 
general, however, put 1 / (2j5) = 7 > 0 and define 
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6x^ = 



Using this new notation Eq. 4-90 becomes 

dx = 76 jc^ + dx^. (4-93) 

This representation of 6x has important prop¬ 
erties given by Theorem 4-15. 

Theorem 4-15: and 6x^ of Eqs. 4-91 

and 4-92 satisfy the conditions 

1 . 6x^ = 0 

2. ^ 6x^ = Ag 

bx * 

3. =0 
dx 

4. ^ 6 jc^ < 0 
dx 

A method of choosing y still has not been 
given. This parameter is interpreted as a 
step-size and may be determined by one¬ 
dimensional search or any other scheme 
chosen by the designer. In different applica¬ 
tions, different methods have proved effec¬ 
tive. No single scheme has been found that 
seems best. The choice of 7 at this time 
constitutes an art as much as a science. 

The use of this method may now be 


summarized in the Steepest-descent Al¬ 
gorithm With Constraint Error Compensation. 

Step 1. Make an engineering estimate of 
the solution of NLP'. 

Step 2. Let the iteration number be / > 0. 
Compute and form 

A[x^^] andg. 

Step 3. Compute dg/djc[jc^^M and 

dfldx[x^^] and choose the de¬ 
sired change Ag in g. 

Step 4. Compute dx^ and 6x^ in Eqs. 4-91 
and 4-92. 

Step 5. Choose 7 by a suitable scheme. 
Calculate X in Eq. 4-89. If any 
components X^ are less than zero 
for gJx^M which are close to 
zero, remove these components 
from g and return to Step 3. If all 
X^ > 0 , proceed. 

Step 6 . Form 

dx = 76 ^^ + dx^ 
and 

+ dx. 

Step 7. If -flx^^] 1 and || dx || 

are sufficiently small, terminate 
the process. Otherwise return to 
Step 2. 

4-5 STEEPEST DESCENT SOLUTION OF 
THE FINITE DIMENSIONAL OPTIMAL 
DESIGN PROBLEM 

In this paragraph a steepest-descent method 
of solution of the problem OD is developed. 
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In many ways, the method of this paragraph 
is similar to the method of par. 4-4. Here, 
however, a distinction is made between design 
and state variables, and the two types of 
variables are treated quite differently. 

The problem to be solved here is, just as in 
par. 4-2: Choose beR^ and z€/?'*to minimize 

f(z. b) (4-94) 

subject to 

hiz,b) = 0 OD (4-95) 

and 

«(r. b) < 0 (4-96) 

where h(z,b) = [h^iz,b\ ..., /i^(z,b)l^, and 
0 (z,b) = [^i(z.b), ..., 0^(z.b)]^. The state 
equations, Eq. 4-95, are put into vector form 
here in order to take advantage of the 
compact matrix calculus notation. 

The steepest descent algorithm for OD is 
developed here by first approximating the 
nonlinear elements of OD by linear expres¬ 
sions in the various variables. The difference 
between the method presented here and that 
of par. 4-4 lies in the treatment of the state 
variable. In a sense, the state variable is a 
nuisance since it does not really describe the 
system being designed. The algorithm pre¬ 
sented here is obtained by first eliminating 
the state variable from the linearized problem 
and then solving an explicit problem for an 
optimum improvement in the design variable. 

Very much as in par. 4-4, an engineering 
estimate of the optimum design is made. It is 
denoted by b^^K Then the state equations, 
Eq. 4-95, are solved for the corresponding 
state z^®^ Any method of analysis may be 
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used to solve Eq. 4-95 for z. The object now 
is to determine a change in denoted 6b, 
such that 

^( 1 ) = ^( 0 ) + 5 ^ ( 4 . 97 ) 

will be an “improved** design. The meaning of 
“improved** will be made clear as the analysis 
progresses. If the new design variable b^^^ 
were substituted into Eq. 4-95, this equation 
could be solved for the corresponding new 
state variable z^^K Since the matrix 
db/dz[z^o>,b<®>] is nonsingular, the implicit 
function theorem. Ref. 4, page 181, guaran¬ 
tees that if II db II is small, then z^^^ — z^®^ 
will be small. The change in z is denoted 6z so 
that 

^(1) = ^(0) + ^2, (4-98) 

4-5.1 AN APPROXIMATION OF THE 
PROBLEM OD 

The basic idea in the approach to OD 
presented here is to construct an approxima¬ 
tion of OD which can be solved to obtain an 
improvement 6b in The approximate 
problem is obtained by making linear approxi¬ 
mations to nonlinear functions in OD. Linear 
approximations to the changes in /(z.b), 
hiz,b)y and ^jiz,b) due to the small changes 
6b in and 6z in z^®^ are, by Taylor*s 
Formula, Ref. 7, page 56: 

(4-99) 

+ K[z(0) ft(0)]5ft 

ob 

(4-100) 

3 [.<«>, »<«>]« 
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and 


/>«>] 

(4-101) 

+ |^rz(0), 1,10)] sb. 

00 ^ 

In the development that follows, the argu- 
ments of all functions will be 

understood unless otherwise explicitly noted. 
The symbol 6 in front of a quantity simply 
denotes the total differential of that quantity. 


Since 0 and z^®^ + 6z is to 

satisfy the equation /i(z^®^ + 6z,h^®^+ 6!>] = 
0 , the linearized version of this condition is 
simply 

dh dh 

— 6z’^—8b^0, (4-102) 

3z db 

Eq. 4-102 is viewed as determining 6z as a 
function of db. It is clear that Eq. 4-102 can 
be solved for 6z since the matrix bh/dz has 
been assumed nonsingular. 

Inequality constraints, Eq. 4-96, will be 
treated in an approximate manner. The 
method employed here is to require that if 
> 0, then 

< A4y, (4-103) 

where A4y is the required change in the value 

of 0, due to the changes 6z and db in z^®^ and 
^( 0 ) 

For convenience of notation, define the set 
of indices 

A = |/| > o| . (4-104) 


The set A (possibly empty) simply contains 
all the indices / of constraints that will be 
required to satisfy Eq. 4-103. To make 
maximum use of vector calculus notation, 
define the column matrix 



(4-105) 


If the set i4 is empty, then ^ is defined as 
zero; i.e., all the constraint functions whose 
indices are in i4 are placed in a column 
matrix. In this way, the conditions, Eq. 
4-103, may now be written. 


—oz + —oft < A0, 
dz dft 


(4-106) 


where the column matrix A^ is defined as 



(4-107) 


If i4 is empty, A^ is defined to be zero. 

The object of the following analysis will be 
to choose 6ft so that/lz^®^ + 6z.ft^®^ + 6ft] is 
as small as possible. If this nonlinear function 
of dz and 6ft is replaced by its Taylor 
approximation, the problem is to choose dz 
and 6ft to minimize 

bf bf 

df^^dz’^^db, (4-108) 

dz dft 

The entire argument up to this point has 
been based on the fact that || 6ft || will be 
small. In order to insure that this is the case, 
it will be required that 

db^Wdb<i^ (4-109) 
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for ( small and W a positive definite matrix. 
The matrix W will be used in particular 
problems to assign weights to the various 
components of 6b. This is often necessary 
when the components of b represent different 
physical quantities that may be of different 
orders of magnitude. Usually W is diagonal. 

To summarize the approximate problem, 
Sb and 6z are to be chosen to minimize 


bz bb 


subject to the constraints 


c ^ 

— 62 + — 6b = 0, 

dz bb 


X 4. Xfc aJU 

— 6z + — 6b <; A0 
bz bb 


and 

6b^W6b< 


(4-110) 


(4-111) 

(4-112) 


(4-113) 


4-5.2 SOLUTION OF THE APPROXIMATE 
PROBLEM 

Necessary conditions of Theorem 4-9 could 
now be applied directly to the approximate 
problem, Eqs. 4-110 through 4-113. If this 
course of action is followed, however, an 
explicit inverse of bh/bz must be computed. 
Since the dimension n of this matrix is often 
quite high, this operation would be very 
costly. Instead of applying necessary condi¬ 
tions immediately, Eq. 4-111 will be used to 
eliminate the dependence of the remaining 
functions of the problem on 6z. Necessary 
conditions may then be easily applied for the 
determination of 6b. 

The obvious method of eliminating depen- 
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dence on 6z is to solve Eq. 4-111 for 6z as a 
function of 6b. This, however, requires the 
inversion of the matrix bh/bz. The preceding 
approach of applying necessary conditions 
was scuttled for just this reason, so another 
method of eliminating 6z must be found. 
Note that if the terms (bf/bz) 6z and 
(b^/bz) 6z could be found in terms of 6b, 
then dependence on 6z would be eliminated. 
This is the approach that will be taken here 
and also in a later chapter on infinite dimen¬ 
sional problems. 

Define the column matrix as the solu¬ 
tion of 



dz 


(4-114) 


and the matrix X^ as the solution of 


bh^ - bf^ 

"ar- 


(4-115) 


Note that X^ is a matrix whose columns are 
solutions of 


bh^ ^ a»y^ 
bz bz 


(4-116) 


for j^A. Note that Eqs. 4-114 and 4-115 
require the repeated solution of equations 
with the same matrix on the left and different 
right-hand sides. There are efficient computa¬ 
tion codes which can construct all the solu¬ 
tions simultaneously. 

To see how these newly defined matrices 
are helpful, compute the transpose of both 
sides of Eqs. 4-114 and 4-115 and multiply 
through on the right by 6z to obtain 

(4-117) 

bz bz 
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and 


9 T hh 

^ = (4-118) 

oz oz 

Note that the terms on the right side of these 
equations are exactly the ones which are to be 
eliminated from Eqs. 4-110 and 4-112. Fur¬ 
ther, the term {hh/hz) bz that appears in both 
left-hand sides can be obtained from Eq. 
4-111 as 


OZ bb 


Using this relation, Eqs. 4-117 and 4-118 
become 


bb bz 


and 


xT bh bS 

—&b=^Sz. 
bb bz 


Substituting these relations into Eqs. 4-110 
and 4-112, the approximate problem be¬ 
comes: bb is to be chosen to minimize 


bb 

subject to the constraints 


Sb< Af 
bb^Wbb < 
where 


bb bb ^ 


(4-119) 


(4-120) 

(4-121) 

(4-122) 


and 


„4 _ 

“~ — X^, if i4 is not empty 

Od Ob 


= 0, if i4 is empty. 


(4-123) 


It should be noted that if the limitation, 
Eq. 4-121, on the size of || II is not 
enforced, then the problem, Eqs. 4-119 and 
4-120, is just a linear programming problem 
that may be solved by well-established tech¬ 
niques of linear programming. This technique 
is similar to that used in Zoutendyk’s method 
of feasible directions (Ref. 8). For a discus¬ 
sion of this method the reader is referred to 
the literature. 

The necessary conditions of Theorem 4-9 
may now be applied to this reduced problem. 
In order to apply the theorem and in later 
calculations, it is required that the matrix 9^^ 
have full row rank; i.e., that the rows of 9^^ 
(columns of are linearly independent. 
Further, for use of the theorem it is required 
that the column vector Wbb be linearly 
independent of the columns of It may be 
noted that these assumptions require that 
there can be no more than k - 1 constraint 
functions which are zero or positive at any 
iteration. This is true since the matrix 9^ has 
only k rows and since its columns must be 
linearly independent of there can be at 
most k - 1 remaining linearly independent 
columns. These assumptions are reasonable 
from a physical point of view. If 9^ had rank 
k then the equation 


would uniquely determine bb, and there 
would be no optimization problem. 


The constraints, Eqs. 4-120 and 4-121, will 
be treated differently, so different multiplier 
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notation in Theorem 4-9 will be used for 
each. First, deHne 

H = + bb-\-vhb^Wbb. 

Theorem 4-9 requires that 

hH T T 

=0 = 8-' + 2^66^H' (4-124) 

bob 

where> 0and p> 0 

- A4>,) = 0, ieA (4-125) 

and 

v(bb^Wbb-^^) = 0. (4-126) 

At this point, a computational difficulty 
arises. It is difficult to determine db from 
Eqs. 4-124,4-125, and 4-126 since it is not 
known which of the constraints, Eqs. 4-120 
and 4-121, will be equalities and which will be 
strict inequalities. The question is, “Which of 
the inequalities, Eq. 4-120 or Eq. 4-121, will 
become strict inequalities?” This can be inter¬ 
preted geometrically as a question of leaving 
the boundary and going into the interior of 
the constraint set defmed by Eqs. 4-120 and 
4-121. It has been the experience with this 
technique that once a constraint, say 0y(z, b), 
becomes zero, then for several small steps db 
it will remain zero. This observation has led to 
the following computational procedure. First, 
all constraints, Eqs. 4-120 and 4-121, will be 
assumed equalities and db is determined using 
Eqs. 4-124, 4-120, and 4-121. Then the 
algebraic signs of the and p are checked. If 
they are all non-negative, then this is the 
desired solution of the problem. If, on the 
other hand, some p^ or p are negative, then 
the constraints corresponding to these multi¬ 
pliers are removed from Eq. 4-120 or Eq. 
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4-121 and the problem is again solved with 
the reduced number of constraints. 

In any method of solution of the approxi¬ 
mate problem, no information is gained if = 
0. Therefore, in the following > 0 will be 
assumed. 

Solving Eq. 4-124 for db, 

bb = -:^W-'(t^ + (4-127) 

It is now assumed (to be checked later) that 
Eq. 4-120 is an equality. Substituting for db 
from Eq. 4-127 into the equality Eq. 4-120, 

- ^ 8* ^ H'-* (8-' + 8*A) = 

Rewriting this equation, 

£« ^ H/-I A = - 8^ ^ H'-* 8-' - 2»' 

Since is required to have full row rank 
and H'"* is nonsingular, the matrix 

1 , if i4 is empty 

^ (4-128) 

H'"* £^, if i4 is not empty 

is nonsingular. Therefore, 

A = H'-' 8-' + 2v A^). (4-129) 

Note that in the unconstrained case when A is 
empty, /z = 0 since £^ ^ = 0 and = 0. 

Substituting from Eq. 4-129 into Eq. 4-127 

+ (4-130) 
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This expression for db could now be 
substituted into db^Wbb = to solve for v. 
However, in practice it seems just as realistic 
to choose p > 0 in an iterative process as to 
choose Once p > 0 has been chosen ix may 
be evaluated in Eq. 4-129. If any components 
are n^ative, the corresponding elements of ^ 
are removed and db is calculated using the 
new f matrix. 

To aid in interpreting the meaning of terms 
in Eq. 4-130 for 6b, define 

6b^ (4-131) 

and 

66 * = H'-' A# (4-132) 

In this notation, 

(4-133) 


The vector 6b* may be interpreted as a 
constrained gradient with l/2i^ taken as a step 
size. The matrix which multiplies in Eq. 
4-131 essentially projects the gradient of 
the cost function onto a tangent plane to the 
constraint set. The term 6b^ serves to drive 
any errors in constraint functions to zero. 
These interpretations are supported by Theo¬ 
rem 4-16. 

Theorem 4-16: The vectors 6b* and 6b^ of 
Eqs. 4-131 and 4-132 have the following 
properties: 

1. 66*^M'66' =0 

2 . 8 *^ 66 ' =0 


3. 8*^66* =A^ 

4. -8-'^66‘ < 0. 

An obvious check on convergence is to 
monitor 6b and the associated reduction in 
6 /. When small 6b occur and essentially no 
improvement is made in /, the process is 
terminated. This test, however, leaves a great 
deal to be desired since the choice of p can 
yield very small steps 6b and falsely lead the 
designer to believe that the iterative process is 
converging. 

A much better test is to monitor the 
constrained gradient 6b*. Since in an uncon¬ 
strained problem the gradient must approach 
zero at a minimum, one might expect that 
once = 0, the constrained gradient 6b* 
should approach zero. The real quantity 
||6b*|| could then serve as a convergence 
check. Theorem 4-17 makes these ideas more 
rigorous. 

Theorem 4-17: Let b), h(zjb\ and 
b) be continuously differentiable functions. If 
the sequences [b^M and [z^M generated by 
the above algorithm converge to the solution, 
z, b of the problem OD and if ^ = 0 for all 
sufficiently large /, then it is necessary that 
6 b* approaches zero as j approaches <». 


4-5.3 STEEPEST DESCENT ALGORITHM 

The iterative procedure developed in this 
paragraph may be summarized as follows: 

Step 1. Make an engineering estimate of 
the optimum design variable, b^®\ 

Step 2. In the /th iteration, j > 0, solve Eq. 
4-95 for z^^^ corresponding to b^\ 
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Step 3. Form the vector of constraint 
functions ^ in Eq. 4-105 and solve 
Eqs. 4-114 and 4-115 for and 
X^. 

Step 4. Compute and in Eqs. 4-122 
and 4-123. 

Step 5. Choose in Eq. 4-107. 

Step 6. Compute in Eq. 4-108. 

Step 7. Compute and hb^ in Eqs. 
4-131 and Eq. 4-132. 

Step 8. Choose p > 0 and evaluate /i in Eq. 
4-129. If any components of are 
negative, take the corresponding 
elements out of f and return to 
Step 3. 

Step 9. Compute 

b<J*i) = b^) -J- Sb' + 66 *. 

2v 

Step 10. If |/0cy*‘>] - /[xy>]l and 
II bb^ II are sufficiently small, ter¬ 
minate. Otherwise, return to Step 
2 . 

4-5.4 USE OF THE COMPUTATIONAL AL¬ 
GORITHM 


the designer should be familiar with the 
method of obtaining the given algorithm. In 
this way, problems with peculiar features 
often can be treated by altering the general 
algorithm slightly. 

There are two steps in the algorithm of par. 
4-5.3 which are not complete. They are Steps 
8 and 10. In Step 8, a parameter p is to be 
chosen, but no analytical method of choosing 
it is given. This is the classical difficulty with 
steepest-descent methods. They give a direc¬ 
tion but, unfortunately, they do not allow 
analytical determination of a step size (l/(2i') 
in this case). 

A simple technique for choosing v which 
has worked well in a number of problems is 
given here as a candidate scheme. Since it is 
the fib* component of fib which tends to 
reduce /, the step size determination will be 
based on fib*. The basic idea is to choose p in 
order to obtain a certain percentage reduction 
in /. Let A/ (a negative quantity) be the 
desired reduction in / for a single iteration 
(perhaps a 5% to 10% reduction). Since for 
A# «0, 

(4-134) 

2p 

p is chosen as 


The algorithm presented in par. 4-5.3 will 
certainly not solve all optimization problems. 
It is presented primarily to guide the designer 
to the proper equations developed in par. 4-5 
while he is solving a problem. Almost surely a 
complicated real-world optimal design prob¬ 
lem will have some feature which is not 
explicitly contained in the general formula¬ 
tion OD. In order to utilize a steepest-descent 
philosophy similar to the one developed here, 
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p 


£-^^fib* 
2A/ • 


(4-134) 


In many problems p has been chosen accord¬ 
ing to Eq. 4-134 on the first iteration and 
held constant throughout the iterative pro¬ 
cess. In other problems convergence prop¬ 
erties were improved if is changed during 
the iterative process. 
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CHAPTER 5 

FINITE DIMENSIONAL OPTIMAL STRUCTURAL DESIGN 


5-1 INTRODUCTION 

Throughout this handbook, structural 
optimization problems are chosen to illustrate 
the use of the design methods developed. 
There are two principal reasons for using 
structural problems for illustration. First, 
there has been great emphasis on helicopter 
and man portability of materiel, which places 
a premium on structural weight. Illustrative of 
Army concern with lightweight structures is 
the theme of the 1970 Army Mechanics 
Conference, “Lightweight Structures” (Ref. 
33). 

A second key reason for highlighting struc¬ 
tural optimization is its advanced state of 
development, relative to other areas of the 
mechanical engineering sciences such as dy¬ 
namics of machinery and mechanisms. A few 
examples in these related areas are treated in 
this handbook, but development of com¬ 
putational techniques remains to be done. It 
is felt that if the reader develops a thorough 
understanding of structural optimization and 
computational techniques, he will be in a 
good position to address problems outside the 
realm of structures. The fact that the math¬ 
ematics of structural analysis parallels that of 
related mechanical disciplines strengthens this 
feeling. 

A cursory review of Army materiel needs 
convinces one that light weight is a require¬ 
ment for a majority of weapon systems being 
developed by the Army. The high priority 
placed on air mobility as well as lightweight 


infantry equipment has presented weapon 
system designers with a major challenge. In 
the case of air mobility, minimum equipment 
weight is a necessary condition for maximum 
helicopter payload. In infantry applications 
equipment weight limits the soldier’s fire¬ 
power and mobility. 

In seeking lightweight designs, one is 
tempted to simply use lightweight materials 
and lower safety factors. It becomes apparent, 
however, that structural weight reduction can 
significantly degrade system performance. For 
example, when the weight of an artillery piece 
is reduced by 30%, dynamic response due to 
firing the weapon becomes much more severe. 
In infantry weapons, the requirement of 
reduced weight has led designers to lighter 
weight operating mechanisms for individual 
weapons. In lightweight rifles, for example, 
bolts are much lighter than in previous weap¬ 
ons and hence are more sensitive to changes in 
friction due to dust and external particles 
than were the more masi^ive bolts in t^e M14 
and Ml Rifles. There are many e\amples, 
some of which will be discussed later in this 
handbook, of instances in which simply re¬ 
ducing weight of subsystems causes problems 
which did not occur in heavier designs. 

The lightweight objective, then, requires 
that the developer take an overall system view 
and consider the interaction between weapon 
weight and performance of the weapon sys¬ 
tem. As is true in virtually every design 
problem in which the limits of technology are 
approached, the lightweight weapon design 
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problem must be considered simultaneously 
with all other aspects of system design. It is 
not practical to expect, therefore, that one 
will find lightweight structural design special¬ 
ists operating independently of designers con¬ 
cerned with other aspects of the weapon 
development. A technology is needed which 
will allow trade-offs concerning weapon- 
weight to be integrated into the overall 
weapon design process. 

The objective in this treatment has been to 
formulate the minimum weight structural 
design problem with constraints realistically 
reflecting the performance requirements of 
the weapon system. A detailed formulation 
and solution of this structural design problem 
is presented in this Chapter as well as Chap¬ 
ters 7 and 9. 

5-1.1 LIGHTWEIGHT VS STRUCTURAL 
PERFORMANCE TRADE-OFFS 

Normally, achieving a lightweight structure 
requires a reduction in the amount of material 
used. The consequence is an increase in 
structural flexibility that causes increased 
deflections, decreased natural frequencies, 
and decreased buckling loads. Consequently, 
failure modes that were not previously criti¬ 
cal, may now become limiting factors in 
design. For example, in gun supporting struc¬ 
tures, increased deflection often reduces 
effectiveness of the weapon system by in¬ 
creasing dispersion. There are many ways in 
which such changes in structural performance 
can have an impact on overall system 
behavior. 

The only effective approach to minimum 
weight structural design is to formulate the 
structural design problem to include con¬ 
straints on performance which are dictated by 
functional use of the weapon system. As a 


result, the minimum weight design problem is 
often stated with explicit constraints on 
structural deflection, natural frequency, buck¬ 
ling load, and strength. A central part of the 
design problem, then, is representation of 
weapon system performance requirements 
that have an impact on structural design. It is 
often required that in doing structural design, 
dynamic weapon performance must be ana¬ 
lyzed to assure that the proper constraints are 
included in the structural design problem. 

5-1.2 WEAPON DEVELOPMENT PROB¬ 
LEMS ASSOCIATED WITH LIGHT¬ 
WEIGHT REQUIREMENTS 

To further explore some of the trade-offs 
between lightweight and Weapon system per¬ 
formance, several typical problems en¬ 
countered in weapon development will be 
discussed in this paragraph. The discussion 
here is presented to highlight some typical 
problems, not necessarily to identify all light¬ 
weight structural design problems faced in 
weapon development. 

5-1.2.1 AIRCRAFT ARMAMENT 

Some of the most critical lightweight struc¬ 
tural development problems in weaponry 
today are in the field of aircraft armament. 
This is due to the very high priority placed by 
the Army on improved air nubility and the 
need for minimum weight weapon systems to 
be carried by helicopters. The combination of 
lightweight structural requirements and the 
extreme environment under which the struc- 
tiue must perform in helicopter application, 
generates a very difficult class of minimum 
weight structural design problems. The weap¬ 
on developer’s interest in structural design for 
aircraft armament lies primarily in the area of 
weapon and weapon support structures. 
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The key structural requirement in this 
application is accurate aiming of an automatic 
weapon during firing. Dynamic response of 
the weapon support structure due to inputs 
from the weapon and from the airframe, 
which vibrates due to aerodynamic inputs, 
must be considered in the design problem. 
The most difficult feature of the minimum 
weight structural design problem for aircraft 
weapon applications is the variety of per¬ 
formance and failure constraints which must 
be treated in the design process. Constraints 
must generally be placed on stresses arising in 
the structure, angular deflection of the struc¬ 
ture at the gun mount, and natural frequency 
of the supporting structure. These constraints 
generally appear in the form of inequalities. 
For example, stress is required to be less than 
or equal to the allowable stress for the 
material. This kind of constraint is very 
realistic, from an engineering point of view, 
but makes the solution of the optimal design 
problem rather difficult. 

In addition to altering the geometry and 
distribution of material in the structure to 
obtain desirable performance, it is also possi¬ 
ble to induce damping into the structure and 
to use active feedback control devices to 
reduce response. These two methods of re¬ 
ducing dynamic response will require addi¬ 
tional weight on board the helicopter. There 
is a trade-off between design of the structure 
and design of other means of obtaining 
improved weapon system performance. These 
trade-offs, then, require that we treat the 
aircraft weapon design problem as a system 
problem, explicitly accounting for the inter¬ 
action between structural behavior, damping, 
and active feedback control. 

5-1.2J2 GUN BARREL DESIGN 

A second area in which lightweight struc¬ 
tural design is of critical importance is that of 


gun barrel development, particularly for 
infantry automatic weapon application. With 
a great deal of emphasis being placed on 
lightweight infantry weapons, the barrel is a 
natural component in which to seek weight 
reduction. This is particularly true for rapid 
fire weapons in which heavy barrels have 
traditionally been used to alleviate tempera¬ 
ture problems. For a particular barrel con¬ 
figuration, decreased mass tends to cause 
elevated temperatures and stresses. To com¬ 
plicate matters, material strengths are highly 
temperature dependent, making stress con¬ 
straints difficult to handle. Another potential 
problem, as one tends toward optimality in 
barrel design, is the possibility that material 
yield properties will become critically depen¬ 
dent upon strain rates and require their 
explicit inclusion in the design process. 

Another problem, which can arise in re¬ 
duced weight design, is barrel deflection with 
resulting reduction in weapon accuracy. 
Deflection constraints must, therefore, be 
considered. 

The objective of the barrel design problem 
is to choose barrel dimensions and structural 
material to minimize barrel weight in the 
presence of constraints on dollar cost, tem¬ 
perature, stress, and perhaps strain rate. The 
optimal design problem must then include 
equations of state of stress and temperature as 
a function of time, both depending on the 
barrel design features. 

5-1.2.3 TOWED ARTILLERY 

The principal objective in towed artillery 
design is to provide support for a large-caliber 
tube that will, upon firing, transmit momen¬ 
tum to the earth without doing damage to the 
support structure and without undue dynamic 
response. The fundamentals of the design 
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problem then lie in the field of mechanics 
and, in particular, are highly dependent upon 
the weight distribution within the artillery 
piece. 

In traditional artillery design, the support 
structure is flexible but has been quite heavy 
and stiff in the past so that the flexibility of 
the structure was a higher order effect. Also, 
heavier carriages reduced the severity of the 
dynamic response problem due to their higher 
mass. Recent developments, such as the 
Ml02, 105 mm Howitzer, have resulted in a 
weapon that weighs approximately 3200 lb, 
as compared to the older Ml01 which 
weighed 4500 lb. As a result of the reduced 
weight, problems have arisen in providing a 
firm support for the artillery piece on soil. 
More recent design efforts, including the 
XM164, 105 mm Howitzer, and XM198, 155 
mm Howitzer, have resulted in weapons 
which are considerably lighter than their 
predecessors. As a result of the reduced 
structural weight of the weapon, dynamic 
response in both of these weapons became 
critical and had to be treated as a key design 
constraint in development of the recoil mech¬ 
anism. For a discussion of a particular prob¬ 
lem, the reader is referred to the artillery 
design example of par. 8-5. 

Although these are primarily mechanical 
system design problems, they have arisen due 
to the lightweight design criterion. For this 
reason, when one considers lightweight struc¬ 
tural design he must be willing to fit his 
structural design problem into a larger system 
design program and clearly understand the 
interfaces arising between structural and other 
system performance characteristics. 

5-1.2.4 OTHER WEAPON PROBLEMS 

The example problems cited in par. 5-1.2 
are meant to illustrate the essential features of 
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some of the more complex lightweight struc¬ 
tural design problems faced in weapon devel¬ 
opment. They are simplifications of the real 
problems but are difficult enough to illustrate 
the need for research in development of 
design methods. In view of the current em¬ 
phasis within the Army on air mobility and 
lightweight systems, new design methods are 
required which are capable of solving these 
and many more lightweight design problems. 

5-1.3 PLAN FOR TECHNIQUE DEVELOP¬ 
MENT 

The remainder of this chapter will be 
devoted to formulation and application of a 
method of structural optimization. As noted 
at the beginning of par. 5-1, an in-depth 
treatment of lightweight structural design 
provides insight into application of the gen¬ 
eral methods of Chapter 4. 

For a comprehensive review of structural 
optimization through 1967, the reader is 
referred to Refs. 1 and 2. Several of the major 
classes of optimal structural design problems 
are outlined in Ref. 2. Some of the key papers 
which have appeared in the literature ^ce 
1967 are listed in Refs. 3 through 18. 

5-2 ELEMENTS OF THE ELASTIC STRUC¬ 
TURAL DESIGN PROBLEM 

A class of optimal structural design prob¬ 
lems in which the structure must remain 
elastic is treated in this paragraph. The objec¬ 
tive of this paragraph is to show how the 
optimization methods of Chapter 4 can be 
used to solve realistic optimal design prob¬ 
lems. No attempt is made here to present a 
complete theory of optimal structural design 
that is capable of solving all problems. 

The reader should note that, even for the 
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class of problems considered here, it is not 
possible to blindly apply the techniques of 
Chapter 4. A certain amount of knowledge of 
structural analysis is required before a reason¬ 
able statement of the design problem and a 
method of solution can be obtained. Even 
more important, the structural designer needs 
to have a thorough knowledge of the opti¬ 
mization methods of Chapter 4 and their 
development. As will be seen, in some cases it 
is required that parts of the design problem be 
interpreted in light of the derivation of the 
optimization method. In this way the method 
may be adapted for solution of a particular 
class of problems. 

6-2.1 THE OPTIMALITY CRITERION 

The meaning of optimal or best must be 
clearly established in each problem of inter¬ 
est. In order to have a problem which may be 
solved by the previously developed optimiza¬ 
tion methods, a real valued measure of the 
cost of the structure (value of the structure) 
must be chosen. Such measures as dollar cost 
of the structure, weight of the structure, or 
dynamic response of the structure may be 
chosen. 

Along with the choice of a cost function, 
the parameters, or design variables, that repre¬ 
sent all design alternatives must be chosen. 
These parameters will often be dimensions of 
structural members, area of member cross 
sections, or locations of joints in the struc¬ 
ture. In keeping with the notation of the 
preceding chapter, these design variables will 
be denoted as , / = 1,..., m. For con¬ 
venience of notation, these variables will be 
put in the vector form b = [bi . . b^ )^. 

Invariably, the behavior of the structure 
under load will have to be considered in the 
design problem. The response of the structure 
may include quantities such as stress, displace¬ 


ment, buckling loads, and natural frequency. 
The collection of all variables required to 
describe this response due to applied load will 
be denoted by the state variable vector z. The 
manner in which z is related to the design 
variables and applied loads will be discussed in 
some detail later in this paragraph. 

The cost of the structure must now be 
described as a real valued function of the 
design and behavior variables. In keeping with 
the preceding notation this function will be 
denoted as 

j ^ nz.b.i) ( 5 - 1 ) 

where is one or more eigenvalues such as 
buckling load and natural frequency. Before a 
meaningful discussion of treatment of the 
structural design problem may be given, the 
behavior of the structure due to loads and 
constraints on that behavior must be ana¬ 
lyzed. 

5-2J2 STRESS AND DISPLACEMENT DUE 
TO STATIC LOADING 

It is assumed for now that the structure of 
interest is either made up of a finite number 
of distinct interconnected members or that 
large continuous members in the structure 
have been approximated by a finite number 
of elements as in finite element techniques. 
Further, it is assumed that the entire structure 
is described by a vector design variable b. 

Let stresses at critical points in the struc¬ 
ture be denoted Z] > . •, z^ and displacements 
required for the analysis and design of the 
structure be denoted z^^j,..., z^. The 
behavior of the structure due to any given 
load may then be specified by the vector state 
variable z = [zj,..z„ ] ^. Attention will 
be restricted here and in the remainder of this 
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chapter to structures which obey Hook’s law, 
i.e., stress and displacement are determined 
by linear equations. It is clear, however, that 
the design variables play a large part in the 
response of the structure to loads. The depen¬ 
dence on the design variables enters these 
linear equations through the coefficients. The 
equations for z will be denoted 

A(b)z^P (5-2) 

where Z’ is a matrix of loads and 

A(b)^[a^^mn^ n ( 5 - 3 ) 

is a matrix whose elements depend on the 
design variables. 

In this formulation of the problem, z and P 
may be generalized state and load variables. 
Eq. 5-2 may be obtained through direct 
application of equilibrium and compatibility 
conditions or through application of varia¬ 
tional criterion for equilibrium. In today’s 
structural analysis technology, Eqs. 5-2 are 
very likely to be obtained by finite element 
methods (Refs. 19, 20). If the structural 
analysis problem is properly formulated, the 
matrix A(b) is nonsingular and z may be 
obtained by solving Eq. 5-2. It is assumed that 
the elements of the matrix A{b) are differ¬ 
entiable with respect to b. 

In most real-world structural design prob¬ 
lems the structure is required to carry a whole 
family of loads that occur at different times 
in the life of the structure. The treatment 
here will be limited to a finite number of 
loads, denoted/’^ / = 1,..., s. Associated 
with each load is a state z^ determined by Eq. 
5-2. 

^Constraints on behavior of the structure 
due to each of the applied loads P may 
include bounds on stresses and displacements. 
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These constraints can generally be written in 
the form 

0(2, b, f) < 0 (5-4) 

where . . 

The inequality constraints, Eq. 5-4, are 
required to be satisfied for each of the states 
z^ due to different applied loads P^. 

It is clear that the Eqs. 5-2 and constraints, 
Eqs. 5-4, fit into the formulation of the finite 
dimensional optimal design problem of par. 

4- 5. Treatment of the restrictions imposed by 
Eq. 5-4, however, must be delayed until 
similar restrictions due to other behavior 
constraints are accounted for. The entire 
problem will be treated in par. 5-3. 

5- 2.3 NATURAL FREQUENCY AND 

BUCKLING 

As pointed out in par. 5-1, the desire to 
obtain lightweight structures has led to 
resonance problems and, likewise, buckling 
problems. It is necessary, then, that a mean¬ 
ingful optimal design methodology be capable 
of enforcing constraints on eigenvalues 
associated with the system response. The sort 
of constraint considered here is 

f > fo (5-5) 

where i is buckling load or natural frequency 
and is a lower bound on that eigenvalue. 
More general restrictions than those of Eq. 
5-5 are included in the general constraint, Eq. 
5-4. 

Much as in Eq. 5-2, the equations of 
vibration or buckling may be written in the 
form 

K{b)y^!;M(b)y (5-6) 
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where y = [j'l,.. .,y„ 1^ is an eigenvector 
which plays the role of a state variable, 

/:(*) = IV6)]„X» (5-7) 

is generally symmetric positive definite ma¬ 
trix, and 

= [rn^^m „ „ (5-8) 

is generally also a symmetric positive definite 
matrix. Eq. 5-6 is often obtained through 
a finite element formulation of the structural 
analysis problem (Refs. 19, 20). 

There are many methods for obtaining the 
eigenvalue and associated eigenvector in Eq. 
5-6. The first method requires that the inverse 
of K{b) be computed. Multiplying through 
Eq. 5-6 by ^-^(6), 

K-^(b)M{b)y=^ y (5-9) 

This problem is now in standard form and the 
largest eigenvalue of K~^(b)M(Jb) is sought. 
The power method of obtaining this eigen¬ 
value is quite effective (Ref. 21). It is par¬ 
ticularly effective when a good estimate of 
the eigenvector is available. In the iterative 
design technique, a good estimate is generally 
available from the previous iteration. The 
power method is, therefore, well suited for 
use in iterative techniques. This method does 
have the severe disadvantage that K~ ^ {b) 
must be computed for each new b. 

A different method of fmding the smallest 
eigenvalue and associated eigenvector of Eq. 
5-6 without computing K'' ^ (6) is based on 
the Rayleigh quotient as discussed in par. 2-8 
and Ref. 23. The smallest eigenvalue of Eq. 
5-6 is obtained by choosing a normalized 
vector y which minimizes the quotient 

K(b)yl[y^M(b)y]. The minimum value 


of this quotient is the smallest eigenvalue. A 
direct method of minimizing the Rayleigh 
quotient is discussed in par. 2-8. 

5-2.4 METHOD OF SOLUTION 

In the preceding formulation of the opti¬ 
mal design problem, the cost functions and 
constraints associated with stress and displace¬ 
ment can be put into the format of the 
problem treated in par. 4-5. The constraints 
associated with natural frequency and buck¬ 
ling, however, are not of exactly the same 
form. One difficulty is that the coefficient 
matrix for the eigenvector y, K(b) - iM(b), 
must be singular at the solution. This clearly 
contradicts the assumption in par. 4-5 that 
the state equations uniquely determine the 
state variable. 

This situation is a direct result of Murphy’s 
law “if anything can go wrong it will’’. 
Actually, it is not realistic to expect that a 
mathematical formulation of the kind pre¬ 
sented in par. 4-5 should contain aU real- 
world design problems. Already, an important 
problem has been encountered which requires 
an understanding of the development of par. 

4- 5 in order to include the new problem in 
the steepest-descent algorithm. The eigenvalue 
problem, fortunately, can be treated very 
nicely by the steepest-descent technique. 
Elevelopment of the method will be done in 
par. 5-3. 

5- 3 STEEPEST DESCENT PROGRAMMING 

FOR OPTIMAL STRUCTURAL DE¬ 
SIGN 

In order to obtain a steepest-descent al¬ 
gorithm for the design problem with con¬ 
straints on eigenvalues, it is necessary to go 
back into the derivation of the algorithm of 
par. 4-5. The major effort required here will 
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be the linearization of the structural design 
problem to obtain an approximate problem of 
the kind described by Eqs. 4-119 through 

4- 121. 

5- 3.1 LINEARIZED COST AND CON¬ 

STRAINT FUNCTIONS 





(5-15) 


for each > 0. All this follows since h(z,b) in 
the general formulation is simply A(f))z in the 
present problem so 


Since the cost and constraint functions 
depend on z, Z>, and the first order 
perturbation in these functions due to small 
changes 6z, bb, and b^ in z. b, and is 


db 


- if 


and 


+ + (5-10) 

dz db db 

and 

m d0 d0 d^ 

if 

The problem of writing the perturbed cost 
and constraint functions explicitly in terms of 
bb now reduces to obtaining explicit ex¬ 
pressions for the terms involving 6z and bj;. 



with A symmetric so A^ = A. 

Thus, the explicit dependence of Eqs. 5-10 
and 5-11 on bz can be easily eliminated. It 
remains to determine b( in terms of bb. This 
problem has been addressed in a completely 
rigorous manner by Kato (Ref. 23). Explicit 
expressions are given there under quite restric¬ 
tive hypotheses. A formal development will 
be given here which obtains the same result. 


From Eqs. 4-117 and 4-118, and the 
perturbed state equation we obtain, just as 
Eq.4-119, 

= - X-'’' ^ t/l(6)zl«i» (5-12) 

dz db 

and 


It is assumed that the eigenvalues and 
eigenvectors of 

Mb)y = !:AfWy (5-16) 

depend continuously on b and further, that to 
first order, the following perturbation equa¬ 
tion is accurate 


^Sz = -\*^lA(b)z]5b (5-13) 

dz db 

where and are determined by 

A\'=~ (5-14) 

dz 


m)by+ ^ lK(b)y]Sb = 8mb)y 

Ob 

+ ^mb)y]bb + mb)8y 


(5-17) 


where y and f satisfy Eq. 5-16. 


and 


If K{b) or Mib) is not symmetric, it is 
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necessary to solve the adjoint eigenvalue 
problem 

K^{b)y = iM^(b)y (5-18) 

that has the same eigenvalue f as Eq. 5-16 but 
a different eigenvector y. Rearranging and 
premultiplying by y ^ this is 

y ^(^(*) - mb)]6y+y^ bb 

00 

[mb)ym 

Ob 

= y’^mib)y 

Since the first term is a scalar, 

[A:(6) - liMm by^by"^ (A:’'(6) 

-iM^my. 

Since y is an eigenvector of Eq. 5-18, 

[K'^(b)-m^my^o 

and this equation becomes 

{ y'^ ^ - fpr A [M(b)y] j bb 

= biy'^M{b)y. 

Assuming y ^ M(Jb)y ^ 0 which will generally 
be the case, 

{ y'^ Y^^[K{b)y] 

(5-19) 

- iy"^ ^ [M{b)y] I bb^[y^M{b)y]. 

Derivation of the perturbation formula, Eq. 
5-19, has been strictly formal. The assump¬ 


tion that Eq. 5-17 holds is highly questionable 
from an operator theoretic point of view. 
Under reasonable assumptions on the finite 
dimensional eigenvalue problem treated here 
however, Eq. 5-19 is shown to hold (Ref. 23); 
Le., even though the justification given here is 
not mathematically rigorous, the result, Eq. 
5-19, holds for a large class of problems. 

Defming 



X ~I [y'^M{b)y ], 

or 

0, if ^ is empty. 

Eqs. 5-10 and 5-11 become 

bJ = i''^bb (5-22) 

and 

b^ = ^^bb. (5-23) 


[M{b)y] 


r^-) 


(5-21) 
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The linearized problem is now to minimize 
dJ, £q. 5>22, subject to constraints 

where Aji is the desired correction in con¬ 
straint error and 

6b^ W6b < 

where W is positive defmite and ( is small. 
This is precisely the same problem in par. 4-5, 
Eqs. 4-119 through 4-121, so the theoretical 
results and steepest-descent algorithm of that 
paragraph apply with proper interpretation. 

5-3.2 STEEPEST DESCENT ALGORITHM 
FOR OPTIMAL STRUCTURAL DE¬ 
SIGN 

Step 1. Make an engineering estimate of 
the optimum design variable b^^^. 

Step 2. For / = 0, 1, ..., solve Eq. 5-2 for 
Eq. 5-6 for and and 
Eq. 5-9 for y (if kib) or Mib) is 
not symmetric) with b = b^^. 

Step 3. Form ^ as in Eq. 4-105. Solve Eqs. 
5-14 and 5-15 for and X^. 

Step 4. Compute 9^ and in Eqs. 5-20 
and 5-21. 

Step 5. Choose Af as the desired reduction 
in constraint error. 


- + 2vAf). If any 

component of m is negative, re¬ 
move the corre^onding row from 
f and return to Step 3. 

Step 8. Compute 

(/- C* )«■' 
and 

and form 

2 » 

Step 9. Compute 

Step 10. If all constraints are satisfied and 
66* is sufficiently small, termi¬ 
nate. Otherwise, return to Step 2 
and continue the process. 

All the properties of 66* and 66^ derived 
in par. 4-5.2 hold in this case. Further, the 
discussion of that paragraph regarding such 
things as choosing p also hold. The reader 
should refer to that paragraph for detailed 
discussions. 

5-3.3 COMPUTATIONAL 
CONSIDERATIONS 


Step 6. Compute 




1, if ^ is empty 
T 

H'"* , elsewhere. 


Step 7. Choose p > 0 and evaluate /x = 
5-10 


Several comments on the computational art 
used in solution of these problems are in 
order. First, if a feasible design was chosen 
initially, large steps could be taken until one 
or more constraints were vicriated, at which 
time the step size was reduced. Second, it was 
noted that as the optimum was approached. 
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oscillation occuired. By monitoring the dot 
product, ^ bb^~ oscillations were 

sensed when negative values of the dot prod* 
uct occurred. Thus, step size, l/(2t'), was 
divided by two when negative values of the 
dot product occurred on two successive itera¬ 
tions. Finally, the most effective method of 
adjusting step size was to monitor successive 
reductions in cost function after feasibility 
had occurred. Once insignificant reductions 
occurred, the step size was reduced to obtain 
finer convergence. 

The Power method used to compute the 
smallest eigenvalue performs quite well. At 
every iteration, the starting value for the 
eigenvector is taken from the previous itera¬ 
tion which manifested a very rapid rate of 
convergence. An accuracy of 0.1% in each 
component of the eigenvector was used to 
compute the new eigenvector. The stiffness 
matrix for the structure was inverted by the 
Gauss-Jordan elimination procedure. 

Another conunent that is appropriate here 
concerns the sign check on the Lagrange 
multiplier vector /i, called for in Step 7 of the 
computational algorithm (par. 5-3.2). The 
algebraic sign of each component of the 
Langrange multiple vector /i was checked at 
each iteration. If some of the components 
were negative, then the matrix and the 
vector Af were adjusted accordingly. This 
procedure is particularly useful whenever 
there were redundant constraint violations. In 
some cases, the number of constraints vio¬ 
lated is more than the number of design 
variables of the problem, yielding a singular 
matrix coefficient of /i- In such cases numeri¬ 
cal noise yielded a solution such that some of 
the components of the vector were always 
negative, indicating that the corresponding 
constraints would be strictly satisfied in the 
next iteration. In numerical examples, the 


number of constraints with positive compo¬ 
nents of M was always less than or equal to the 
number of design variables of the problem. 
This procedure of adjusting the constraint set 
has worked very well and has minimized the 
possibility of divergence of the algorithm. 

The method presented is relatively auto¬ 
matic in the sense that, for the computer 
program developed, the input data given is the 
only pertinent design information required 
for solution of the problem. All the necessary 
matrices and their derivatives are automati¬ 
cally generated in the computer. Any person 
with a reasonable knowledge of FORTRAN 
language should be able to handle the pro¬ 
gramming without any difficulty. The method 
is developed to meet simultaneously di^lace- 
ment, strength, and frequency requirements 
on the structure. The technique, therefore, 
can be made user oriented. 

5-4 OPTIMIZATION OF SPECIAL 
PURPOSE STRUCTURES 

Several special purpose structural optimiza¬ 
tion problems are solved in this paragraph on 
an ad-hoc basis to iUustrate the method of 
par. 5-3. Subsequent paragraphs will treat 
large scale problems in a more unified man¬ 
ner. 

5-4.1 A MINIMUM WEIGHT COLUMN 

A column is to be constructed by making 
its cross section piecewise uniform as shown 
in Fig. 5-1. The objective of the design 
problem is to choose the element areas so that 
the column will support a vertical load Pq 
without buckling or yielding under compres¬ 
sive load. For the purpose of the present 
problem the geometric shape of each column 
element is fixed and symmetric about two 
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Figure &1. Column 


orthogonal axes so that the cross-sectional 
area of the ith element completely specifies 
the element. With this assumption, if a is the 
second moment of the cross section of unit 
area, then 

(5-24) 


where is the allowable stress of the 

column material in compression. 

In order to apply the optimization method 
of par. 5-3, the equations which determine 
the buckling load in terms ofb=lbi ,.... 1 ^ 

must be obtained. Using the generalized co¬ 
ordinates shown in Fig. 5-2 and Eq. B-4, 
Appendix B, the potential energy of the /th 
element under the buckling load P is 

PE, = ^ K'ibW - P u*^D‘(b)u* (5-28) 
where 

«'= Mj. «i]^ 

is as shown in Fig. 5-2. The matrices K*(b) 
and D‘(b) are from Eqs. B-4 and B- 8 , Appen¬ 
dix B 


In this problem, weight of the column is to 
be minimized so that the cost function is 

J = y i b,L, (5-25) 

where 7 is material density and is the 
length of the /th element of the column. 

There are two basic constraints that must 
be satisfied in this design problem. First, to 
insure that the buckling load P is not less than 
the applied load Fq, it is required that 



X 


i 



Figure Column Element 


0. (5-26) 


12 

-6L, 

- 12 

-6L, 

Second, in order to insure that the column 
material does not yield under the applied load 

Eabj 
K‘(b)^ - i- 

-eii 

AL] 

eL^ 

2L] 

Fq, it is necessary that 


-12 

6L, 

12 

U, 

= < 0. 

(5-27) 

1 = 1 .....At 

and 



7L, 

AL] 
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(5-30) 


Summing the total potential energies of all 
the elements from Eq. 5-28 and defining a 
new variable 


y=[yi.yi. -,y2k]^ 


(5-33) 

since does not depend on P, / = 1. k and 

/(a 


o** + l _ 


db 


(Arca)^) y 



= [Uj, «J, uj, Mj, .... u*. u*]^ 

the total potential energy PE of the coliunn 
may be written 

PE =^~yT - P^y^ D(b)y 

where K(b) and Dib) are made up of elements 
of K^ib) and D^(b) and are symmetric. Apply¬ 
ing the theorem of minimum total potential 
energy given in Appendix B, the governing 
equations of buckling are 


The computations required in Eq. 5-34 are 
messy but they can be programmed for 
automatic computation. 

All expressions required for direct applica¬ 
tion of the steepest descent algorithm of par. 
5-3 are now available. Numerical results and 
profiles of optimum columns are shown in 
Tables 5-1 and 5-2, and Fig. 5-3. Numerical 
data for the example problems are £ = 3.0 x 
10^ psi, a = 0.079577, = 20,000 psi, 

and I = 10.0 in. Computation in each case 
required approximately 0.1 sec per iteration 


Kib)y^PDib)y, (5-31) 


Eq. 5-31 is now in the form of Eq. 5-6, with 
proper interpretation of notation. 

In order to implement the computational 
algorithm of par. 5-3, the following vectors 
are required: 

T bJ 


TABLE 5-1 

COMPARISON OF UNIFORM 
AND OPTIMAL COLUMNS 


P. lb 

Volume of 
Optimel 
Column, in? 

Volume of 
Uniform* 
Column, in? 

Saving*, % 

500 

0.806 

0.923 

12.7 

1000 

1.143 

1.300 

12.1 

1500 

1.411 

1.600 

11B 

2000 

1.640 

1.840 

10.9 

4000 

2.412 

2.600 

7.2 


*Li^t8St uniform column which will support load P. 
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TABLE 5-2 

CROSS-SECTIONAL AREAS OF OPTIMUM COLUMNS 


Element 

No. i 

P-500 lb 

P - 1000 lb 

P - 1500 lb 

P-2000 lb 

P-4000 lb 

1 

0.1070 

0.1499 

0.1833 

0.2106 

02947 

2 

0.1055 

0.1480 

0.1809 

0.2076 

0.2875 

3 

0.1035 

0.1442 

0.1763 

0.2023 

0.2789 

4 

0.1000 

0.1383 

0.1601 

0.1942 

02683 

5 

0.0960 

0.1303 

0.1593 

0.1831 

02505 

6 

0.0631 

0.1198 

0.1464 

0.1683 

0.2302 

7 

0.0738 

0.1064 

0.1299 

0.1493 

0.2020 

8 

0.0623 

0.0892 

0.1088 

0.1250 

0.2000 

9 

0.0477 

0.0668 

0.0612 

0.1000 

0.2000 

10 

0.0267 

0.0500 

0.0750 

0.1000 

0.2000 




Figure 5-3, Profiles of Optima! Columns' 

and 15 iterations to converge on an IBM 
360-65. 

54.2 A MINIMUM WEIGHT VIBRATING 
BEAM 

A beam is to be designed by piecing 
5-14 


together uniform sections of beams as shown 
in Fig. 5-4. The objective is to choose the 
sections so that the beam is as light in weight 
as possible and still satisfies constraints on 
strength and natural frequency. Due to dy¬ 
namic inputs to the beam, it is required that 
the natural frequency of the beam be above a 
given limit co© to prevent oscillation prob¬ 
lems. 

As in the preceding column design prob¬ 
lem, the cross-sectional geometry is chosen, 
but all dimensions of the cross section may be 
varied in the same proportion. Thus, if 
denotes the area of the ith section, then the 
second moment of the cross-sectional area is 

/,. = ab] (5-35) 

where a is a constant of proportionality 
depending on the geometry of the cross 
section. The problem at hand is to minimize 



Figure 5-4. Stepped Beam 
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weight, so the cost function is 

/ = p 2 b.L. (5-36) 

/»! ' ' 

where p is material density and is the 
length of the /th section. 

As a strength constraint, it is required that 

0/ = ^o-^<O, (5-37) 

where > 0 'v& chosen so that the beam will 
support a lateral load. The constraint on 
natural frequency can be written as 

= (Oo - < 0- (5-38) 


12/, 


-12/, 

-6/,i, 

-6/,L, 

4/,i, 

6/,L, 

2I,L] 

-12/, 

6/,i, 

12/, 

61,L, 

-6/,L, 

11,Lf 

6/,I, 

ai,l] 


(5^0) 

Forming a single vector y that contains all 
displacements and rotations for the beam, the 
total kinetic and potential energies are 
y^M(b)yl2 and y^K{b)yl2, respectively. 
Lagrange^s equations, Eq. B-17, arc then 

M(b)HO-^K{b)yit)^0 (5^1) 


By neglecting compression of the beam, 
deformation of a typical element is shown in 
Fig. 5-5. By Appendix B, the kinetic energy 



Figure 5-5. Typical Element 


of an element \su^M^{b)u^l2^ where, from Eq. 
B-6 

156 -22L^ 54 \3l' 

-22Lf 4£y - 13L^ -3L^ 

pbfLf 

54 -131, 156 22£, 

13£, -3£/ 22£, 41* 

(5-39) 

Likewise, the potential energy of the ith 
element is uf^K^(byu^l2, where, from Eq. B-4 


For harmonic motion of the structure, yit) = 
y sin cor, where y is just a constant vector, t is 
time, and co is natural frequency. Substituting 
into Eq. 5-41 and defining f = co^, the 
eigenvalue equation is 

Kib)y^mb)y {5A2) 

The problem of minimizing J of Eq. 5-36, 
subject to the constraints of Eqs. 5-37 and 
5-38, and with state Eq. 5-42, is in the form 
of the general problem of par. 5-3. The 
steepest-descent computational algorithm of 
that paragraph can be applied directly to this 
problem. 

As a numerical example, the beam problem 
was solved with the data £ = 3x10*' psi, L = 
10 in., a = 1.0, and p = 0.00208 lb-sec*/in?. 
The computational algorithm required about 
0.6 sec per iteration on an IBM 360-65 system 
and approximately 15 iterations to converge. 
Results for a range of natural frequencies are 
given in Table 5-3 and the profile of an 
optimum beam is shown in Fig. 5-6. 
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TABLE 5-3 

COMPARISON OF OPTIMUM BEAMS 


Volume of Optimum 
Frequency, Uniform Volume, Meterial 

red/tec Beam*, in? in? SavinQi,% 


3600 

0.935 

0B97 

4.06 

4000 

1.155 

1.062 

8.05 

4400 

1.397 

1.259 

9.74 

4800 

1.663 

1.481 

10.94 

5200 

1.951 

1.727 

11.48 

5600 

2.263 

1.993 

11.93 

6000 

2.598 

2.283 

12.12 

10000 

7.217 

6.330 

12.29 


* Uniform beam of lowest volume having required 
natural frequency. 


/ 




Figure &6. Prof He of Optimum Beam 


B-4J3 A MINIMUM WEIGHT PORTAL 
FRAME WITH A NATURAL FRE¬ 
QUENCY CONSTRAINT 


A portal frame as shown in Fig. 5-7 is to be 
proportioned so that it weighs as little as 
possible and has its fundamental frequency at 
least as large as a specified frequency cJq. 
Each member of the planar frame is formed 
from several uniform sections whose areas are 
to be determined as design variables. As in the 
preceding problems, the cross-sectional geom¬ 
etry is taken as fixed and all dimensions of 
cross sections varied proportionally. The 
second moment of the cross-sectional area 
about a centroidal axis is = otbj where is 
the cross-sectional area of the ith element. 

Neglecting strain energy due to axial defor¬ 
mation of the horizontal member, the ele- 

5-16 


ment stiffness matrix from Appendix B is 


EabJ 



12 

- 6L, 

- 12 


- 6Lf 

4LJ 

6Lf 

2L] 

-12 

6L, 

12 

6Lf 

- 6Lf 

2L] 

6Lf 

-1 




(5-43) 


where is the length of the ith member and 
the element deformation variables are shown 
in Fig. 5-8. The potential energy PE of the ith 
element is 



Figure &B Typical Eiements 
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PE,=^u‘^K(bf)u‘ (5-44) 

where = [Wj, Wj. W3, w^]^. 

likewise, from Appendix B the kinetic 
energy KE of the typical element is 

KEf 

where u denotes time derivative of u and 

156 -22L^ 54 131^ 

-22L^ AL] -\^L^ -3L] 

54 - 13L^ 156 22L^ 

\^L^ —3Lf 22L^ 4Lf 

(5-45) 

Taking into account the lateral rigid body 
motion of Member 2, the total kinetic energy 
of the structure is 

KE='Z^u‘^M(b,)u‘+ (5-46) 

i 2 ' 2 ^ 

where M is the mass of Member 2 and is 
the horizontal velocity of point A. 

Requiring harmonic motion with frequency 
(o, the displacement vector y(t) made up of 
all displacements is 

y(t) = y sin cj/ 

where y is a constant vector. Applying 
Lagrange’s equations and eliminating time 
dependence yields 

K(b)y^mb)y (5-47) 


where y is the vector of all displacements and 
rotation, and f = co^. The matrices K(b) and 
M{b) are formed from element stiffness and 
mass matrices as outlined in Appendix B. 

Eq. 5-47 is in exactly the form of Eq. 5-16 
and the matrix for this problem is simply 
weight of the structure which is 

k 

/ = p 2 bfL, (5-48) 

where p is density of the structural material. 

The constraints imposed on the problem 
include lower limits on cross-sectional area 

0, k (5-49) 

where > 0 and a lower limit on natural 
frequency 

where is the lowest aUowable eigenvalue of 
Eq.5-47,fo=wJ. 

The steepest-descent algorithm may now be 
applied directly. Data for the specific prob¬ 
lems solved are given in Table 5-4. The results 
for an aluminum portal frame are given in 
Tables 5-5 and 5-6, with a typical profile 
shown in Fig. 5-9. The design variable b^ 
shows the distribution of material for a 
minimum weight frame whose frequency of 
vibration must be greater than or equal to a 

TABLE 54 

MATERIAL PROPERTIES FOR ALUMINUM 

a, dimensionless 0.07958 

p, lb-sec*/in * 2.616x10* 

£. Ib/in.* 10.3x10* 

/(,, in.* 0.000825 

L. in. 10.0 
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TABLE 5-5 

COMPARISON OF UNIFORM AND OPTIMAL 
FRAMES FOR ALUMINUM 


Frequency, 

red/tec 

Weight of 
Uniform 
Frame, lb 

Weight of 
Optimal 
Frama, lb 

Weight 
Reduction, % 

2000 

3.748 

1.729 

53.9 

3000 

8.434 

2.562 

69.6 

4000 

14.994 

3.590 

76.1 

5000 

23.428 

4.688 

80.0 


TABLE 5-6 

OPTIMAL DESIGN VARIABLE b| 

FOR VIBRATING FRAME 


2000 

CO 

3000 

, rad/tec 
4000 

5000 

*1 

1.577 

1.964 

2.907 

4.020 

b2 

0.883 

1.604 

2.484 

3.321 

bi 

0.552 

1.416 

1.912 

2.622 

bA 

0.374 

0.866 

1.290 

1.725 

bi 

0.350 

0.360 

0.671 

0.836 

bA 

0.350 

0.350 

0.350 

0.350 



Figure 5’9. Optimum Porta! Frame for 
(a) = 3000 rad/sec 

specified value. It can be seen from Table 5-5 
that a significant material saving is possible in 
comparison to the portal frame with members 
of constant cross section. 
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The results for the design variables are 
the same for Members 1 and 3, and Member 2 
converges to the lower bound /q , so only the 
results for Member 1 are reported. For all 
frequencies the values for the for Membo* 2 
are equal to 0.350. 

M.4 A MINIMUM WEIGHT FRAME WITH 
MULTIPLE FAILURE CRITERIA 

To iUustrate the applicability of the 
steepest-descent method for the minimum 
weight design of structures with stress, buck¬ 
ling, and displacement constraints, an ex¬ 
ample of a statically loaded frame problem is 
presented. Fig. 5-10 shows the geometrical 



Figure & 10 Frame With Side Loading 


configuration of the frame that is considered. 
All members are assumed to be of the same 
length L. Member 1 is subjected to a lateral 
loading q(/). Member 3 has a uniform crosa- 
sectional area which is prescribed and will not 
be allowed to vary. The connections at points 
A and B are frictionless pins. 

The finite element method is used to 
obtain the elastic response of the system for a 
given set of design variables, i.e., the cross- 
sectional areas of the elements. As in the 
preceding problem, the geometry of each 
cross section is the same with all dimensions 
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of cross section varying proportionally. Thus, 
/. = where is the cross-sectional area of 
the ith element. The stiffness matrix K(bf) of 
a typical element, Fig. 5-11, can be written as 
in par. 5-4.3 with generalized displacements 
defined by 


«'= (u,, Mj. Uj, u^l^. 



Figure 5- / /. Typical Elements 


From the fundamental beam theory, if R is 
the horizontal force transmitted from the 
Member 1 to 3, and assuming that Member 2 
remains straight without buckling, then 
neglecting compression of Member 2, the 
deflection at >4 is = RL^I{3EI^). From 
the equilibrium conditions on the transverse 
forces and moments at the nodes of Member 
1 , the generalized displacement z, which is 
made up of the element displacements can 
be evaluated from the following matrix equa¬ 
tion 

A{b)z^F (5-51) 

where F is a vector load and Aib) is a 
symmetric matrix. In a similar manner, if y is 
the displacement vector containing all ele¬ 
ment deflections associated with Member 2, 
the buckling load P can be determined by 
solving the eigenvalue problem 

K{b)y^PDib)y (5-52) 


where the matrix D is derived from the 
shorteningof Member2 as in par. 5-4.1 and, as 
in the previous problem, K(b) is a stiffness 
matrix. 

The cost function to be minimized in this 
problem is the structural weight of Members 1 
and 2 which is simply 

where y is the weight density of the material. 

The weight of the frame is to be minimized 
subject to the following constraints: 

1. Stress constraints at the ith node of 
Member 1: 

where = Mc(bf)lll(bf)] is bending stress, 
cibf) = is half the depth of the beam 

at point I, / = 1,2,. . ., m and is the 

maximum allowable stress. The parameter 0 is 
a property of the cross-sectional geometry. 

2. Deflection constraint: 

where is the horizontal deflection at the 
top of Member 1 and A is the maximum 
allowable lateral deflection of the top of the 
frame. 

3. Buckling constraint: 

3EI^ 

K*2 = -[r •^A ( 5 - 55 ) 

where the first term is just the load R carried 
by the column. 
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4. Compressive stress constraint at the /th 
node of Member 2: 


P 

■«•/••• 2 “ ~ ^max 2 ^ ^ 


(5-56) 


for in Member 2. 


The steepest-descent algorithm can now be 
applied directly. Given data are: A = 4 in.; 
^max, = 85,000 psi; = 5,000 psi; 

cross-sectional area of Member 3 is 4 in?; £ = 
100 in.; E = 3x10^ psi, a = 0.07958, and = 
0.5642. The resulting horizontal forces/? that 
correspond to increasing constant lateral loads 
q, given in Table 5-7, are 285 lb, 409 lb, 458 
lb, and 458 lb, respectively. For q - 20 and 
25 Ib/in., the displacement in Eq. 5-54 is an 
equality. For lower loads it is a strict in¬ 
equality. This was determined automaticaUy 
by the algorithm. The results for different 
side loadings are given in Tables 5-7 and 5-8. 
A profUe of an optimal frame is shown in Fig. 
5-12. Computation in each case required 
approximately 0.5 sec per iteration and 15 
iterations to converge on an IBM 360-65. 

TABLE 5-7 

OPTIMAL DESIGN VARIABLE b- FOR 
STATIC FRAME 

CroM-S«ctional Atm b|, in.’ 


Ekimnt 
No. i 

10 

q, Ib/in. 

15 20 

25 

0 

ID 

0 

Ib/in. 

20 

25 

1 

1.49 

2.03 

2.76 

4.51 

0.226 0.271 

0.286 

0.286 

2 

0.86 

1.20 

1.76 

3.34 

0.366 0.438 

0.464 

0.464 

3 

0.40 

0.47 

0.48 

2.10 

0.407 0.487 

0.516 

0.516 

4 

0.43 

0.53 

0.37 

0.43 

0.366 0.438 

0.464 

0.464 

5 

0.43 

0.53 

0.52 

0.43 

0.226 0.271 

0.286 

0.286 


TABLE 5-8 

VOLUME OF OPTIMUM FRAME 


q, Ib/in. 

10 15 20 25 


Optimum 

Volume, 504.2 533.4 558.0 656.0 

in. 3 _ 



Figure & 12. Profile of Optima! Frame 
With Multiple Failure 
Criteria (q = 25 ibAn.) 


54.5 A MINIMUM WEIGHT PLATE WITH 
FREQUENCY CONSTRAINTS 

As a final numerical example in this para¬ 
graph consider the problem of minimum 
weight design of the simply supported rectan¬ 
gular plate shown in Fig. 5-13 subject to a 
natural frequency constraint. The bending 
equation for plates of variable thickness is 
given in Eq. 5-58. When the deflection 
W{x,yj) is written in the form 

W(,x, y, t) = >v(jc, y) cos cot (5-57) 

the governing equation becomes 



Figure 5-13. Rectangular Plate 
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D^*w + 2 — — 7*w + 2-— — V^w 
oy ox oy oy 


+ 7i£>.7JH--(l -v) 


1 2 3 4 5 



- 2 


d^D a^w a^g a^w 
a^a^ a^aj' a^'* ax* 


= A pw* w 


(5-58) 


where 


Figure 5-14. Collocation Points 


D(x.y) = 


Eh^ix,y) 
12(1 -»»*) 


(5-59) jjj jjjg steepest-descent algorithm, the cost 

function that is to be minimized is 


h(x,y) is the thickness of the plate, which is 
the design variable, and p is the density of 
plate material. 

When the function w(jcj') is represented in 
the form 


J = pAA L hix.,y.) (5-61) 

1 ^ 

where AA is the area of the grid squares. 

The constraints imposed on the design are 


w(jc.y) = 2 

m ,««1 


. rriKX 

sm- 

a 



ho - hix^,y^) < 0 


(5-62) 


(5-60) and 


the eigenvalue problem can be solved approxi¬ 
mately by numerical methods. The problem 
posed here is solved using a collocation 
technique, i.e., the differential equation is 
satisfied at discrete points in the region. Fig. 
5-14. 

The number of discrete points is chosen 
equal to the number of terms in the truncated 
series of Eq. 5-60. The drivatives of the 
function D{x,y) at the grid points are evalu¬ 
ated by the use of finite differences. For a 
given set of design variables, i.e., h(x,y), the 
lowest eigenvalue, { = pco^, and the associated 
eigenvector {>4^^ | , which plays the role of 
y, are determined. 


io-i<0 (5-63) 

where Hq > 0 and i:o > 0 are lower limits on 
plate thickness and eigenvalue of Eq. 5-58, 
respectively. 

The steepest-descent computational al¬ 
gorithm applies in a direct way. It should be 
noted that the collocation method for ap¬ 
proximate solutions of the equations for 
natural frequency yields nonsymmetric 
matrices K and M in Eq. 5-16. Thus in this 
formulation of the plate optimization prob¬ 
lem, the adjoint eigenvalue problem, Eq. 5-18, 
must be solved along with the original eigen¬ 
value problem. If finite element methods for 
plate analysis had been used, symmetric 
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matrices would have been obtained. In this 
example, as well as in the preceding, a 
minimum effort was expended to make com¬ 
putations efficient. The emphasis has been 
placed on getting results. A subsequent effort 
will be devoted to making algorithms more 
efficient. 

The minimum weight plate problem was 
solved by the algorithm of par. 5-3 with 
E = 3.0x10*^ psi, p = 7.43 X 10-^ lb-secVin.^ , 
I'= 0.30, and o)© = 1375 rad/sec. The uni¬ 
form plate with f = pcj^ = 1400 was taken as 
the initial estimate to the optimization prob¬ 
lem. The dimensions of the plate are 10.0 in. 
by 10.0 in. and the value of /iq = 0.1 in. The 
material is assumed to have a constant density 
and so minimum weight is equivalent to 
minimum volume. The volume of the uniform 
plate is 11.44 in.^ and the volume of the 
optimal plate is 10.8 in.^ which is a 5.6% 
material savings. Fig. 5-15 shows 25 colloca¬ 
tion points. The numbers in the network are 
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Figure 5-15. Optima! Design Variabie h(x, y) 
for Vibrating Piate 

the values of the thickness function hix,y) at 
each nodal point which is located at the 
center of each square. Double symmetry of 
the optimal plate thickness was observed 
about axes through the point (a/2, b/2). 


*This paragraph is based on the dissertation of Dr. J. Arora, 
Ref. 34. 
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5-5 GENERAL TREATMENT OF TRUSS 
DESIGN* 

The theory presented in pars. 5-2 and 5-3 
will now be applied to the case of general 
plane and space trusses. These types of 
structures are encountered quite frequently in 
practical situations. Most common among 
these are buildings, transmission towers, 
bridges, cooling towers, aircraft structures, 
and lightweight military structures. In all these 
cases, it is desirable that the structure simul¬ 
taneously should meet strength, deflection, 
and frequency requirements and be of mini¬ 
mum weight. In this chapter, all these con¬ 
straints will be considered. 

5-5.1 SPECIAL PROBLEM FORMULATION 

In the problems to be considered here, 
geometry of the truss is assumed to be 
specified and the loads are applied only at the 
joints. The objective function for the problem 
is taken as the total weight or the volume of 
the truss, and the design variable for each 
member is taken as its cross-sectional area. 
The objective function of Eq. 5-1 in this case 
is a linear function of m design variables and 
may be written as 

J = p^L^b^ (5-64) 

where p^. and are material density and 
length of member /, respectively. 

The displacement method of structural 
analysis is used, and nodal displacements of 
the truss are considered as basic state vari¬ 
ables. Therefore, the /th component of the 
state variable represents the /th displacement 
component of the truss. Fig. 5-16 shows a 
simple scheme of designating joints, members, 
and displacement components of a truss. Fig. 
5-17 shows a bar element with sign convcn- 
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Figure 5-17, A Truss Element 

tion to be used on element forces and 
deformations. Basic equations of the displace¬ 
ment method for a truss may be written as 


u=pz 
F= K(b)u 
and 


(5-65) 

(5-66) 

(5-67) 


where u is the element deformation vector, F 
is the element force vector, / is the vector of 
external loads applied to structural nodes, and 
0 is a rectangular transformation matrix, 
which transforms the nodal displacement vec¬ 
tor z to the element deformation vector u. 
The matrix K(b) is composed of element 
stiffness matrices and is given by 


K{b) = 


“m_J 


(5-68) 


where m is the total number of elements in 
the truss and is the stiffness matrix for the 
ith element of the truss. The stiffness matrix 
for the /th element may be written as 






(5-69) 


where is Young’s Modulus of Elasticity of 
the 2 th element. Substituting Eqs. 5-65 and 
5-66 into Eq. 5-67, one obtains 


/= lfi^K(b)p]z 
= Kib)z 
where 

K{b) = (i^K{b)& 


(5-70) 


(5-71) 


is the structure stiffness matrix, which is 
identical to A{b) in Eq. 5-2. The mass matrix 
M(b) for the truss may also be computed in a 
similar way, and it is given by 


M{b) = fMib)§ 


(5-72) 


where M(b) is formed from element mass 
matrices and is given by 


M{b) = 




M. 


(5-73) 


Here, is the mass matrix for the ith 
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element of the truss and is given by 


W, [2 l] 

' 6 Ll 2 J. 


(5-74) 


Any nonstmctural mass that is attached to 
the truss may also be added to the mass 
matrix of Eq. 5-72 and it may be written as 




(5-75) 


where is a matrix consisting of nonstruc- 
tural masses. In the example problems to be 
presented later, is taken to be a null 
matrix. However, there is no particular dif¬ 
ficulty in incorporating this matrix if it is not 
zero. Its inclusion will simply change the 
lowest natural frequency of the truss. The 
derivation of the given structural analysis 
equations and matrices is well documented in 
the literature (Refs. 20, 24). 

In order to apply the algorithm of par. 
5-3.2, two main matrices and of Eqs. 
5-20 and 5-21 must be computed. They can 
be assembled very easily once various other 
matrices required in them have been com¬ 
puted. In the class of problems treated here 
fijb) does not depend on the design variable, 
so bf[b)ldb = 0. Also, one obtains from Eq. 
5-64 


bb .’ ^ 


bJ 

— =( 0 , , 0 ) 

bz 


and d//df = 0; and from Eq. 5-14 

= (0,., 0)^. Substitution of these values 

into Eq. 5-20 yields 


«-' = (P.^..Pm ^m)"" 


(5-76) 


Assembling the matrix of Eq. 5-21 is 
more tedious. It requires a formation of 
constraint set ^ as in Eq. 4-105 which will be 
discussed in detail now. The constraint set ^ 
may be divided into five subsets, namely, 
frequency, stress, buckling and displacement 
constraints, and lower and upper boimds on 
the design variables. Explanation of these 
subsets follows one by one and, for each 
subset, matrices b^/bq, and b^/bb are 

computed. 

5-5.1.1 FREQUENCY CONSTRAINTS 

In the example problems, only one fre¬ 
quency constraint is considered. However, if 
other frequency constraints are also present, 
these may be treated in a similar way. Since 
the design variable vector b is available at any 
iteration, the matrices Kib) and M(b) are 
computed from Eqs. 5-71 and 5-72, respec¬ 
tively. The lowest eigenvalue f and the asso¬ 
ciated eigenvector y are then obtained from 
Eqs. 5-5 and 5-6, respectively. Premultiplying 
both sides of Eq. 5-6 by A'^( 6 ), one obtains 

K -1 (bmb)y =jy^yy (5-77) 

where 7 = 1/f. The power method is used to 
find the largest eigenvalue 7 of K'^ibyMib). 
This method of obtaining the largest eigen¬ 
value is quite efficient in the present problem, 
since a very good approximation to the 
eigenvector at each iteration, except for the 
first one, is available from the previous 
iteration. The lowest eigenvalue is then given 
by f=l/ 7 . The frequency constraint may 
now be written as 

(5-78) 

where fo corresponds to a given frequency. 
In terms of the notation used in Eq. 


Digitized by L^ooQle 







AMCP 706-192 


5-4, Eq. 5-78 may be written as 


= - 0 ( 5 - 79 ) 

where 5 is a number assigned to this con¬ 
straint. If this constraint is violated, then 

H = - (fo - f)» =(0... (>). 

= ^l,and3^,/dz = (0,., 0). 

5-5.1^ STRESS CONSTRAINTS 

Since the matrix K(b) is available, Eq. 5-70 
can be solved for the unknown displacement 
vector z. Element forces can then be com¬ 
puted from Eqs. 5-65 and 5-66. Substitution 
of Eq. 5-65 into Eq. 5-66 yields 

F^Sz (5-80) 

where S = It may be noticed from Fig. 
5-17 that two forces are specified for each 
element but the primary force remains con¬ 
stant throughout a bar element. Therefore, 
FJ = -F{, where superscript i denotes the 
element number. Dimensions of the matrix S 
may be reduced from 2m Xn to mXn by 
using this relationship. Stresses in the mem¬ 
bers may now be calculated as 



Once the stress for each member becomes 
known, it is checked against the critical stress. 
A number of these stresses may be violated in 
a particular iteration. The stress constraint for 
the ith member may be written as 

(5-82) 

where is the critical stress for member i. It 
should be noted that, in terms of the nota¬ 
tions used in Fig. 5-17, compressive stress in a 
member is taken as positive and accordingly 


inequality (Eq. 5-82) holds in such cases. For 
tension members, and are negative; 
therefore, Eq. 5-82 is written as oj < a^. The 
expressions that follow are written for the 
case of compression members. For tension 
members similar expressions can be readily 
written. Inequality Eq. 5-82 may be written 
as 


= 0 (5-83) 

where 5 is an index assigned to this constraint. 
In all subsequent constraint subsets, subscript 
5 on ^ will have the same meaning. If Eq. 5-83 
is violated, then = - (a^ - o^), = 0, 



where daJbZj and bF^bb^ may be computed 
from Eq. 5-80. 


&-5.1.3 BUCKLING CONSTRAINTS 

Each compression member of the truss is 
also checked for the Euler buckling load given 
by 






(5-84) 


where and are the critical buckling load 
and moment of inertia of the ith member, 
respectively. It is assumed that the moment of 
inertia of the cross section of a member can 
be written as 

I, = (5-85) 

where is a constant depending upon the 

5-25 


Digitized by 






AMCP 706-192 


cross-sectional geometry of the /th member. 
This is a convenient way of expressing the 
moment of inertia in terms of the cross-sec¬ 
tional area of a member, because the constant 
o^^ can be specified by the designer quite 
readily. Therefore, Eq. 5-84 may now be 
written as 




IT* 




(5-86) 


where z® is the maximum allowable /th 
component of displacement. If a particular 
component of displacement is positive, then 
Eq. 5-89 is written as z^ — z" < 0; and if it is 
negative, then it is written asz"- z^ < 0. The 
expressions that follow are written for the 
case of positive displacement, and similar 
expressions can also be written for the case of 
negative displacement. In terms of the nota¬ 
tion of Eq. 5-4 the constraint for the positive 
displacement may be written as 


where 0^ = K^EftjL]. Eq. 5-86 may be writ¬ 
ten in terms of the critical buckling stress oj 
as 

af =0^6,.. (5-87) 


= < 0 ( 5 - 90 ) 

If the /th displacement component exceeds an 
allowable limit, then 


Now the buckling constraint for the /th 
compression member may be written as 

0,(^. z, i) = Of - < 0. (5-88) 


A0, 


and 


. 

5r =<».lift-"'- 


If this buckling constraint is violated, then 
= - (o, - o*,), 90,/3f = 0, 


d0 d0 1 d/^. 1 /> 



where bF^Ibb^ and bOfldZj may again be 
computed from Eq. 5-80. The buckling con¬ 
straints on all other compression members are 
treated in a similar way. 


5-5.1.4 DISPLACEMENT CONSTRAINTS 


All the displacement components are checked 
and any other violation is treated in a similar 
way. 

5-5.1.5 BOUNDS ON DESIGN VARIABLES 

It may be necessary to put upper and lower 
bounds on each design variable. This con¬ 
straint may be demanded by many practical, 
architectural or structural considerations. 
Moreover, a lower limit on each design vari¬ 
able is required in the algorithm in order to 
avoid the attainment of unrealizable designs 
such as negative areas. This constraint may be 
written as 


The displacement components are known 
at this stage; therefore, the constraints on 
them may be written as 

1 z^ I < z^ (5-89) 


c ft, < fty (5-91) 

where is the lower and b^ is the upper 
bound on the /th design variable. Inequality 
Eq. 5-91 may be split up into two parts: 
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(1) Lower Bound on Design Variables: 

This constraint is written as or in 

terms of notation of Eq. 5-4 

= < 0 (5-92) 


Violation of this constraint yields, 
90 

= -b,), = 0 . 


db 


= (0,.... 0, - 1,0.0),and 

(/Ih) 


dz 


= ( 0 , 


., 0 ). 


Problem par. 4-3.1, the present formulation 
handles it without any difficulty. 

After all the constraints have been con¬ 
sidered, the matrices 30/3ft, 90/9f, and 
30/3z are available and X5 can be solved 
from Eq. 5-15. This still does not allow 
the matrix of Eq. 5-21 to be assem¬ 
bled. The following matrices must also be 
computed 


|-[A:(ft)2l 

Ob 

(5-94) 

Ob 

(5-95) 


(2) Upper Bound on Design Variables: 

This constraint is very similar to the previ¬ 
ous one and in the notation of Eq. 5-4 it is 
written as 

(*, z.i) = bi-b^ <0 (5-93) 

If the upper bound on any design variable is 
violated, then 


A0, = -(6,-fry) 



= 0 , 


db 


= (0,..0, 1, 0,..0), and 

(/th) 


dz 


= ( 0 ,..., 0 ). 


It may be noticed here that the cross-sectional 
area of any member of the truss may be 
assigned a predetermined value by putting the 
same upper and lower bound on it. This 
situation may be encountered in practice due 
to various reasons, and as shown in Example 


and 

^ lM(b)y] . (5-96) 

These matrices are assembled automatically 
from the quantities such as K{b), M{b), z, and 
y, which have previously been calculated in 
the computer. The procedure of computing 
the matrix of Eq. 5-94 will be explained here; 
the matrices of Eqs. 5-95 and 5-96 are 
calculated in an exactly similar manner. Eq. 
5-71 may be written as (see Appendix B): 


K{b)= 2 

/*1 * 

where K. is the only quantity which is a 
function of b. Now, Eq. 5-94 can be written 
as follows, by substituting the above expres¬ 
sion for K(b)\ 
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It should be noted here that the summation 
sign in Eq. 5-97 represents the summation of 
m matrices of dimension (n x m). The 
quantity inside the differentiation sign is an 
n-dimensional vector whose components may 
be dependent upon the design variable vector 
b. Therefore, the quantity inside the summa¬ 
tion sign is an n X m matrix for each index /. 
However, in the present case, since is a 
function of only b., the computation of Eq. 

5- 97 is greatly simplified. Consideration of 
each / in Eq. 5-97 generates a (n x m) matrix 
whose only nonzero elements are in its ith 
column. Computation of Eq. 5-97 is per¬ 
formed quite readily and automatically in the 
computer. 

All the information being available, the 
matrix of Eq. 5-21 may now be assembled 
and the algorithm of par. 5-3.2 may be used 
to solve actual problems. 

6- 5.2 MULTIPLE LOADING CONDITIONS 

Most structures are designed to withstand a 
multiple loading environment. This is quite 
reasonable, because only a certain combina¬ 
tion of loads may act on the structure at a 
particular time. This situation is handled in 
the par. 5-5.1 formulation by expanding the 
state variable vector z to include all states. 
The element force vector / is also expanded 
accordingly. Formulation of displacement, 
stress, and buckling constraints must also take 
into consideration all states of the system. 
This is handled in the manner that follows. 
While formulating a particular displacement 
constraint, the value of displacement for each 
loading case is checked and each violation is 
entered into the reduced constraint vector 0. 
After this, the procedure of calculating the 
matrices 30/3b, 30/3f, and 30/3z is the same 
as explained earlier. An exact same procedure 
is followed in treating stress and buckling 
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constraints. This procedure of taking into 
consideration all the loading conditions has 
worked out quite satisfactorily in the example 
problems. 

5-5.3 EXAMPLE PROBLEMS 

Several trusses are designed by applying the 
procedure presented in this paragraph. A 
computer program, based on the algorithm 
stated previously, was written in FORTRAN 
IV. The computations were performed on the 
University of Iowa IBM 360-65 computer. 
The stiffness matrix for the structure was 
inverted by the Gauss-Jordan elimination 
procedure, and the power method was used to 
find the smallest eigenvalue. 

Results for three typical trusses are pre¬ 
sented here. All these structures were de¬ 
signed with stress, displacement, buckling, 
and frequency constraints. Examples 4-1 and 
4-2, par. 4-1.1, are compared with results in 
Ref. 25. These were designed with and with¬ 
out frequency and buckling constraints in 
order to compare the results with Ref. 25. 
Example 4-3, par. 4-3.1, is treated in Ref. 26, 
and it was also designed with only stress 
constraints in one case to compare results 
with Ref. 26. All sample problems had lower 
limits on areas of the elements and Example 
4-3 had upper limits. The program is general 
enough to handle different lower and upper 
bounds on stresses in an element, elements of 
different materials, and a different buckling 
parameter for each element. The examples 
follow: 

J. Example 5-1. Five-node Four-bar Truss 

Fig. 5-18 shows the geometry and the 
dimensions of the truss. Input and output 
information is given in Table 5-9. In order to 
compare the results with those of Ref. 25, the 
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Figure & 18. Four-bar Truss (Example & 1) 

truss was first designed for stress constraints, 
Fig. 5-19(A), and second for stress and 
displacement constraints. Fig. S-19(B). It 
may be noted that the results presented here 
are at least as good as those presented in Ref. 
25. 

The final design weight with only stress 
constraints was 9.09 lb and computation time 
was 1.820 sec for 12 iterations. The final 
design weight reported in Ref. 25 was 9.09 lb 
with a computation time of 4 sec for 5 cycles. 
The final design weight, with stress and 
displacement constraints, was 14.28 lb and 
the computation time was 1.500 sec for 12 
iterations. The final design weight reported in 
Ref. 25 was 14.30 lb with a computation time 
of 10 sec for 4 cycles. It is difficult to make 
an exact comparison of the computation 
times because the computer used here is 
different from that used in Ref. 25. The 
computation times reported in Ref. 25 are on 
IBM 7094-11-7044 DCS Computer. 

The truss was also designed by including 
buckling and frequency constraints along with 
other constraints. Two different starting 
points were used in optimizing this truss. 
Starting Point 1 was infeasible and Starting 
Point 2 was feasible. The final design weight 
beginning at Starting Point 1 was 113.48 lb 
and at Starting Point 2 was 113.77 lb. The 
slight difference in the two weights was due 


to the fact that in the first case the frequency 
constraint was violated by 0.154%, whereas in 
the second case this violation was 0.143%. 
Fig. 5-19 shows variation of the cost function 
with respect to the number of iterations for 
this problem. It may be noted that for 
practical purposes, convergence was obtained 
in six to eight iterations for all the cases. 

2. Example 5-2. Transmission Tower 

Fig. 5-20 shows the geometry and the 
dimensions of the tower. This example also is 
treated in Ref. 25. In this problem, the 
cross-sectional area of each member of the 




Figure 5-19. Iteration vs Weight Curves for 
Example & 1, Four-bar Truss 
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TABLE 5-9 

FOUR-BAR TRUSS (EXAMPLE 5-1) 

Design Information: For each member. Young's Modulus of Elasticity Ej, the specific weight lower limit on 
aree of cross section bh, and the constant are 10^ kips/in.^, 0.10 Ib/in.^, 0.10 in.^, and 1, respectively. There is 
no upper limit on member size. The resonant frequency for the truss is 284.6 Hz. For Output 1, the stress limits on 
eech member are ±25.0 kips/in.^ and the displacement limits at node five are, 0.0, ± 0.3 in., and ± 0.4 in. in thexi 
X 2 -» and X 3 -directions, respectively. For Output 2, the stress limits for eech member are ± 15.0 kips/in.^, and the 
displacement limits at node five are 0.15 in. in all three directions. There are three loading conditions for the truss; 
they are: in positive Xj-,x 2 and X 3 - directions, 5, 0, 0; 0,5, 0; and 0,0, 7.5 kip, respectively, applied at node five. 


OUTPUT 1. IV/f/i Stress and Displacement Constraints Only 


With only stress constraints 

Time per iteration - 0.152 tec 

Total time - 1.820 tec 

With displacament constraintt, alto 

Time per iteration - 0.124 sec 

Total time ■ 1.500 sec 

El. 

Starting 

Final 

El. 

Starting 

Final 

No. 

Vahias, 

Values, 

No. 

Values, 

Values, 


in.* 

in.* 


in.* 

in.* 

1 

0.100 

0.130 

1 

0.500 

0.234 

2 

0.200 

0.192 

2 

0.500 

0.319 

3 

0.200 

0.120 

3 

0.500 

0.184 

4 

0.100 

0.100 

4 

0.500 

0.128 

Weight, 

10.19 

9.09 

Weight, 

34.86 

14.28 

lb 



lb 




OUTPUT 2. I^ith AH Constraints 


Starting Point 1 

Time per iteration * 0.147 sec 

Total time ■ 4.710 sec 

Starting Point 2 

Time per iteration - 0.172 sec 

Total time - 4.640 sec 

El. 

Starting 

Final 

El. 

Starting 

Final 

No. 

Values, 

Values, 

No. 

Values, 

Values, 


in.* 

in.* 


in.* 

in.* 

1 

1.000 

0.543 

1 

2.000 

0.559 

2 

1.000 

1.961 

2 

4.000 

1.883 

3 

1.000 

3.635 

3 

8.000 

3.703 

4 

1.000 

0.479 

4 

1.000 

0.468 

Weight, 

69.72 

113.48 

Weight, 

257.68 

113.77 

lb 



lb 
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Figure 5-20. Transmission Tower (Example 5-2) 


truss is treated as an unknown design variable, cycles. The values of final design variables 

and the results obtained are given in Table compare quite well with those in Ref. 25. At 

5-10. The tower was designed first with only the final design point all constraints were 

stress constraints. The final design weight in satisfied within 0.006%. 

this case was 91.13 lb with a computation 

time of 38 sec for 12 iterations. The final The tower was also designed with stress and 

design weight reported in Ref. 25 was 91.14 displacement constraints and, finally, with all 

lb with a computation time of 9 sec for 5 the constraints included. The design weight in 
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TABLE 5-10 

TRANSMISSION TOWER (EXAMPLE 5-2) 

Design information: For each member of the structure, the modulus of elasticity £/, the specific weight p/, the 
constant a/, and the stress limits are 10^ kips/in.^, 0.10 Ib/in.^, 1.0 and ± 40.0 kips/in.^, respectively. The lower 
limit on the area of cross section of each member is 0.10 in.^ for the case with stress constraints only and 0.01 in.^ 
for other cases. There is no upper limit on the member sizes. The resonant frequency for the structure is 173.92 Hz, 
and the displacement limits are 0.35 in. on all nodes and in all directions. There are six loading conditions, and they 
are as follows (all loads are in kips): 




Direction of Load | 



Direction of Load j 

Load 

Node 




Load 

Nodt 




Cond. 



X2 

^3 

Cond. 


^1 




1 

1.0 

10.0 



1 

0 

10.0 

-5.0 



0 

10.0 




- 1.0 

10.0 

-5.0 

1 


0.5 

0 

mgm 

2 


-0.5 

0 

0 



0.5 

0 

■■ 



-0.5 

0 

0 


1 

1.0 

-10.0 

-5.0 


1 

0 

- 10.0 



2 

0 

-10.0 

-5.0 


2 

- 1.0 

-10.0 


3 

3 

0.5 

0 

0 

4 

4 

-0.5 

0 



6 

0.5 

0 

0 


5 

-0.5 

0 

■■ 

e 

1 

m 


-5.0 

6 

1 

0 

-20.0 

-5.0 

0 

2 

0 

-20.0 

-5.0 

CM 

0 

20.0 

-5.0 


Output: 



With Stress 

Constraints Only 

With Stress and Dis- 
placement Constraints 

With All Constraints 

El. 

No. 

Starting 
ValuM, in.’ 

Final 

Values, in.^ 

Starting 
ValMs, in.’ 

Final 

Values, in.^ 

Starting 
Values, in.^ 

Final 

Valuas. in.’ 

1 

0.200 

0.100 

1.000 

0.010 

0.500 

0.010 


0.500 


3.000 

2.322 

2.500 

2.092 


0.500 


3.000 

2.322 

2.500 

2.075 


0.500 

0.376 

3.000 

2.322 

2.500 

2.095 


0.500 

0.376 

3.000 

2.322 

2.500 

2.083 


0.500 

0.471 

3.000 

2.768 

2.500 

2.357 


0.500 

0.471 

3.000 

2.768 

2.500 

2.354 

8 

0.500 

0.471 

3.000 

2.768 

2.500 

2.350 

9 

0.500 

0.471 

3.000 

2.768 

2.500 

2.335 

10 

0.200 

0.100 

1.000 

0.010 

0.500 

0.035 

11 

0.200 

0.100 

1.000 

0.010 

0.500 

0.035 

12 

0.200 

0.100 

1.000 


0.500 

0.087 

13 

0.200 

0.100 

1.000 

0.010 

0.500 


14 

0.200 

0.100 

2.000 

0.690 

1.500 

1.113 

15 

0.200 

0.100 

2.000 

0.690 

1.500 

1.113 

16 


0.100 

2.000 

0.690 

1.500 

1.112 

17 

0.200 

0.100 

2.000 

0.690 

1.500 

1.112 
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TABLE 5-10 (Corn'd.) 


Output: (Corn'd.) 



With Stress 

Constraints Only 

With Stress and Dis- 
ptaoemant Constraints 

With All Constraints 

El. 

Starting 

Final 

Starting 

Final 

Starting 

Final 

No. 

ValuM, in.’ 

Values, in.^ 

Values, in.^ 

Values, in.^ 

Values, in.^ 

Values, in.^ 

18 

0.500 



1.524 

2.000 

2.056 

19 

0.500 



1.524 

2.000 

2.058 

20 

0.500 

0.277 


1.524 

2.000 

2.046 

21 

0.500 

0.277 


1.524 

2.000 

2.058 

22 

0.500 

0.380 


2.733 

3.000 

2.822 

23 

0.500 

0.380 

3.000 

2.733 

3.000 

2.808 

24 


0.380 

3.000 

2.733 

3.000 

2.803 

25 


0.380 


2.733 

3.000 

2.785 

Weight, 

lb 

132.37 

91.13 

772.24 

546.18 

668.80 

590.32 


the first case was 546.18 lb with a computa¬ 
tion time of 47 sec for 17 iterations, and the 
maximum constraint violation was 0.00011%. 
The comparable design weight reported in 
Ref. 25 was 555.11 lb with a computation 
time of 24 sec for 7 cycles. This shows that, 
when displacement constraints are also in¬ 
cluded, the results obtained with the new 
gradient projection method are slightly better 
than those of Ref. 25. For a design with all 
the constraints included, the final weight was 
590.32 lb with a computation time of 129 sec 
for 36 iterations, and the maximum violation 
of constraint was 0.028%. Fig. 5-21 shows 
variation of the cost function with respect to 
the iteration number for the last two cases of 
this problem. It may be noted that for 
practical purposes, convergence was obtained 
in only 6 iterations. 

3. Example 5-3. 4 7-Bar Plane Truss 

The schematic diagram of the structure 
with dimensions is shown in Fig. 5-22. This 
example is also treated in Ref. 26 where it is 


optimized for a single loading condition. The 
design information and the results are shown 
in Table 5-11. In order to compare the results 
with Ref. 26, the truss was first optimized 
with stress constraints only. The final design 
weight was 2,993.37 lb with a computation 



Figure 5-21, Iteration vs Weight Curves 
for Exanmie 5-2, Trans¬ 
mission Tower 
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Figure 5-22, 47-Bar Plane Truss (Example 5-3) 


time of 115 sec for 17 iterations. At this 
point the stress in member 18 was violated by 
0.24% and all other violations were less than 
0.035%. Another feasible design occurred at 
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9th iteration for which the design weight was 
2,998.88 lb; maximum constraint violation 
was 0.10% for stress in 7th member and all 
other violations were less than 0.016%. The 
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TABLE 5-11 

47-BAR PLANE TRUSS (EXAMPLE 5-3) 


Design Information: For each member of the structure* the modulus of elasticity E^, the specific weight and 
the constant are 3.0 x 10^ kips/in.^ * 0.284 Ib/in.^* and 1.0* respectively. The resonant frequency for the structure 
is 16.0 Hz and the displacement limits are 1 in. on all nodes and in all directions. There is one loading condition for 
the truss which is shown on Fig. 5-22. Allowable stress in tension for ail members is 21.28 kips/in.^ 


Output: 


El. 

No. 

Lower Area 
Bound* in.^ 

Upper Area 
Bound* in.^ 

Compression 
Stress Limit* 
kipi/in.^ 

Initial Area* 
in.» 

Final Area* in.^ 

With Stress 
Constraints 
Only 

With All 
Constraints 

1 

3.570 

9.620 

14.56 

5.690 

3.570 

7.537 

2 

3.570 

9.620 

14.56 

5.690 

3.570 

5.771 

3 

3.570 

9.620 

14.56 

5.690 

3.570 

3.570 

4 

3.570 

9.620 

14.56 

5.690 

3.570 

4.473 

5 

3.570 

9.620 

14.56 

5.690 

3.752 

6.505 

6 

3.570 

9.620 

14.56 

5.690 

3.570 

6.124 

7 

3.570 

9.620 

14.56 

5.690 

4.212 

7.777 

8 

3.570 

9.620 

14.56 

5.690 

5.217 

9.529 

9 

1.930 

2.940 

15.90 

2.210 

1.930 

1.930 

10 

1.930 

2.940 

15.90 

2.210 

1.930 

1.930 

11 

1.930 

2.940 

15.90 

2.210 

2.205 

2.199 

12 

1.930 

2.940 

15.90 

2.210 

2.205 

2.205 

13 

1.930 

2.940 

15.90 

2.210 

1.930 

2.940 

14 

1.930 

2.940 

15.90 

2.210 

1.930 

2.940 

15 

1.930 

2.940 

15.90 

2.210 

2.205 

2.119 

16 

1.930 

2.940 

15.90 

2.210 

2.205 

2.205 

17 

1.360 

2.190 

15.46 

2.100 

1.417 

2.136 

18 

1.360 

2.190 

15.46 

2.100 

1.815 

1.630 

19 

1.360 

2.190 

15.46 

2.100 

1.360 

1.360 

20 

1.360 

2.190 

15.46 

2.100 

1.360 

1.360 

21 

0.376 

0.376 

3.36 

0.376 

0.376 

0.376 

22 

0.376 

0.376 

3.36 

0.376 

0.376 

0.376 

23 

0.376 

0.376 

3.36 

0.376 

0.376 

0.376 

24 

0.376 

0.376 

3.36 

0.376 

0.376 

0.376 

25 

1.360 

2.190 

12.32 

2.100 

1.360 

1.455 

26 

1.360 

2.190 

12.32 

2.100 

1.360 

1.451 

27 

1.360 

2.190 

12.32 

2.100 

1.360 

2.137 

28 

1.360 

2.190 

12.32 

2.100 

1.360 

1.360 

29 

1.360 

2.190 

12.32 

2.100 

1.360 

1.492 

30 

1.360 

2.190 

12.32 

2.100 

1.360 

1.428 

31 

2.940 

6.040 

17.47 

3.850 

2.940 

3.774 

32 

2.940 

6.040 

17.47 

3.850 

2.940 

2.940 

33 

2.940 

6.040 

17.47 

3.850 

2.940 

2.940 

34 

2.940 

6.040 

17.47 

3.850 

2.940 

5.592 

35 

2.940 

6.040 

17.47 

3.850 

2.940 

3.582 

36 

2.940 

6.040 

17.47 

3.850 

2.940 

2.940 

37 

0.940 

1.320 

4.93 

1.200 

0.940 

0.940 
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TABLE 5-11 (Corn'd.) 


Output: (Coffit*d.) 


El. 

No. 

Lower Area 
Bound, in.^ 

Upper Area 
Bound, in.^ 

Comprettion 
Strati Limit, 
kipt/in.* 

Initial Araa, 
in.* 

Final Araa 

,in.* 

With Strati 
Constraints 

Only 

With All 
Constraints 

38 

0.940 

1.320 


1.200 

0.940 

0.940 

39 

0.940 

1.320 


1.200 

0.940 

0.940 

40 

2.940 

6.040 


3.500 

2.940 

2.940 

41 

2.940 

6.040 

10.75 

3.500 

2.940 

2.940 

42 

2.940 

6.040 

10.75 

3.500 

2.940 

2.940 

43 

2.940 

6.040 

10.75 

3.500 

2.940 

2.940 

44 

2.940 

6.040 

10.75 

3.500 

2.940 

2.940 

45 

2.940 

6.040 

10.75 

3.500 

2.940 

2.940 

46 

2.940 

6.040 

10.75 

3.500 

2.940 

2.940 

47 

2.940 

6.040 

10.75 

3.500 

2.940 

2.940 


Weight, lb 

3910.30 

2993.37 

3771.0 


final weight reported in Ref. 26 was 3,328.5 
lb which is considerably higher than the one 
reported herein. This may be attributed to the 
fact that in Ref. 26 the members are divided 
into eight groups so that there are only eight 
independent design variables, whereas in this 
treatment, area of cross section of each 
member of the truss is treated as an unknown 
design variable. 

The truss also was designed by imposing all 
the constraints. The starting point, stress 
limits, and upper and lower bounds on the 
areas are same as those used in Ref. 26. It 
may be noted that members 21, 22, 23, and 
24 had the same upper and lower bounds on 
areas. The final design weight was 3,771.0 lb 
with a computation time of 166 sec for 24 
iterations. The maximum violation of the 
constraint was 0.27% on stress for member 
11. Fig. 5-23 shows variation of the cost 
function with respect to the number of 
iteration for both the cases. It may be noted 

niiis paragraph b based on the dbsertation of Dr. J. Arora, 
Ref. 34. 
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that for practical purposes, convergence oc¬ 
curred in approximately 6 iterations. 



Figure 5-23. Iteration vs \Neight Curves for 

Example &3, 47-Bar Plane Truss 


5-6 A GENERAL TREATMENT OF PLANE 
FRAME DESIGN* 

In this paragraph, an application of the 
gradient projection method to framed struc- 
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tures will be presented. Rigid frames are 
found quite frequently in practical situations, 
including building and vehicle structures. In 
the present work, optimization of planar 
framed structures using wide flange steel 
sections is considered under the assumption 
of elastic, working stress analysis. The AISC 
Steel Construction Manual (Ref. 27) is used 
for the properties of these sections. The 
constraints considered are stress, buckling, 
displacement, natural frequency, and restric¬ 
tions on design variables. 

5-6.1 PROBLEM FORMULATION 

In the problems considered here, the 
geometry of the frame is assumed to be 
specified, i.e., lengths of the members or the 
joint coofdinates are not treated as design 
variables. Multiple loading conditions for the 
structure are treated by the procedure ex¬ 
plained in par. 5-4.2. The moment of inertia 
for each element is treated as the design 
variable; therefore, b is a vector whose ith 
component is the moment of inertia of the 
ith element. In calculating weight or volume 
of the structure, element direct stresses, ele¬ 
ment bending stresses, area of cross section, 
and the section modulus of each element 
must be known. Also, in order to calculate 
the allowable compressive stress for an ele¬ 
ment, its least radius of gyration must be 
known. These quantities are required as con¬ 
tinuous functions, rather than discrete num¬ 
bers, in the present formulation. Since the 
moment of inertia of each element is its only 
design variable, the quantities area of cross 
section, section modulus, and the least radius 
of gyration must be expressed in terms of the 
moment of inertia of the element. These 
relationships of the ith element are written as 
follows: 

A, = (5-98) 


Z, = (5-99) 

and 

r, =d,b*'* (5-100) 

where is the area of cross section, is the 
section modulus, is the least radius of 
gyration of the ith element of rigid frame, and 
a^, Cf, and are constants. These constants 
can be found by plotting curves of area of 
cross section, section modulus, and the least 
radius of gyration versus the moment of 
inertia of various economical beam and 
column sections. These curves have been 
drawn by Nakamura (Ref. 28) for wide flange 
sections of AISC Steel Construction Manual 
(Ref. 27), and the same values are used in this 
handbook. This approach of obtaining con¬ 
tinuous relationship for area of cross section, 
section modulus, least radius of gyration, and 
the moment of inertia has also been used by 
other researchers in their work (Refs. 29, 30, 
31). 

The objective function, Eq. 5-1, for this 
problem is again taken as the total weight of 
the frame which may be written as 

•'== p/V/V'* (5-101) 

The displacement method of structural analy¬ 
sis is used, and nodal displacements of the 
frame are considered as basic state variables. 
Therefore, the /th component of the state 
variable represents the /th displacement com¬ 
ponent of the frame. Fig. 5-24 shows a simple 
scheme for designating joints, members, and 
displacement components of a frame in the 
structure coordinate system. Fig. 5-25 shows 
a frame element in the member coordinate 
system with the sign convention to be used on 
element forces and deformations. It may be 
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Figure 5-24. Description of a Frame 



noted that F[ and F\ are direct forces on the 
element, F^ and F/ are shearing forces, and 
Fl and F^ are the moments at the end of an 
element. The structural analysis equations 
developed in par. 5-4 and in Appendix B are 
also used here. The element forces are com¬ 
puted from Eq. 5-80 which may again be 
written as follows: 

F = Sz ( 5 - 102 ) 

where 

S^K{b)fi (5-103) 

Dimensions of the matrices K{b) and fi are 
adjusted for the case of frames and F is a 
vector which consists of element forces for all 
the elements of the frame. The stiffness and 
mass matrices for the frame element arc 
different from the truss element. They arc 
given by the following matrices: 


- 


afi. 


- 1/2 


-afi. 


- 1/2 


0 

12//,^. 

6/£< 

0 

niL] 


0 

6/£, 

4 

0 

-6/£, 

2 


afi^ 


0 

0 


- 1/2 


a^b^ 


0 

0 


- 1/2 


0 

niL] 

6IL, 

0 

niL, 

6/1, 


0 

6IL. 

2 

0 

6/Z-, 

4 


(5-104) 
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140 

0 

0 

70 

0 

0 


0 

156 

221 

0 

54 

- 131 

_ p/ 

0 

22L 

41* 

0 

131 

-31* 

420 

70 

0 

0 

140 

0 

0 


0 

54 

131 

0 

156 

- 221 


_ 0 

- 13i 

-31* 

0 

-22L 

1_ 


(5-105) 


Now, as before, the matrices and of 
Eqs. 5-20 and 5-21, respectively, must be 
computed in order to apply the algorithm of 
par. 5-3.2. They can be readily assembled 
once various other matrices have been com¬ 
puted. Let us first consider computation of 
the matrix 9.^ of Eq. 5-20. The matrix 
which is computed from externally applied 
loads, is independent of the design variable 
vector b if the self weight of the elements is 
neglected. This implies that bf[b)lbb = 0. 
Also, from Eq. 5-101 one obtains 

.. 

( 5 - 106 ) 

-^=(0,.0), (5-107) 

dz 

and 3//3f = 0. Eq. 5-14 now yields, = (0, 

.. 0)^. Substituting these values into Eq. 

5-20, one obtains 

.. 

( 5 - 108 ) 

Next, consider computation of the matrix 


9^ of Eq 5-21. It requires formation of the 
constraint vector ^ by considering various 
constraints and computation of matrices such 
as 3^/3f, bf/bz, and bf/bb. The treatment of 
frequency, displacement, and design variable 
constraints in the case of a frame is exactly 
the same as in the case of a truss, which is 
developed in par. 5-4. So, these features will 
not be explained here, except for the fact that 
any point where a displacement constraint 
must be imposed is treated as a nodal point. 
Computation of matrices such as b/bb 
lK(b)z], b/bb lK{b)y], and b/bb lM(b)y] is 
also carried out in the way explained in par. 
5-4. The only constraint that remains to be 
considered is the stress constraint, which will 
be considered next. 

5-6.2 STRESS CONSTRAINT CALCULA¬ 
TIONS 

Let s denote the subscript for this con¬ 
straint. If one can compute b^Jb( and row 
vectors b^Jbz and bfjbb for this constraint, 
then he can assemble the matrix 9^ of Eq. 
5-21. 

The members of a framed structure are 
subjected to direct as well as bending stresses. 
Thus, the effect of combined stresses must be 
considered in implementing the stress con- 
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straints. It should be noted here that a clear 
distinction is made between the elements and 
the members of a frame. This distinction is 
necessitated by the fact that a member must 
often be divided into several elements for 
structural analysis and implementation of 
displacement constraints. On the other hand, 
the compressive stress for all elements making 
up a member is the same. In the present work, 
the members subjected to direct and bending 
stresses are required to satisfy the AISC 
specification (Ref. 27). The permissible stress, 
according to this Steel Construction Manual^ 
are: 


kL 

e=— (5-114) 

= allowable compressive stress 
E = Young’s modulus 

On the cross section of axially loaded col¬ 
umns when kLIr exceeds C^, 

^ 1.49 X 10* 

=-, Ksi (Kip/in? ). (5-115) 

(kLIrf 

4. Combined Stresses: 


1. Tension: 

^, = 0.60^^ (5-109) 

2. Bending: 

=0.66F^ (5-110) 

where F^ is material yield stress, F^ is 
allowable tensile stress, and F^ is allowable 
bending stress. 

3. Compression: 

On the gross cross-sectional area of axially 
loaded compression members, when kLIr, the 
largest effective slenderness ratio of any un¬ 
braced segment, is less than 


a. Axial Compression and Bending: 

Members subjected to both axial com¬ 
pression and bending stresses shall be pro¬ 
portioned to satisfy the following require¬ 
ments: 

(1) When^/F^ < 0.15, 

4-+-^ <1.0 (5-116) 

(2) When/./F, >0.15, 





0 

F.S. 


(5-111) 


where F.S. = factor of safety = 

5/3 + (3/8)«-(l/8)e3 (5-112) 


and, in addition at points braced in the plane 
of bending. 


0 . 6 /?^ 



.0 


(5-118) 


where 




(5-113) = axial stress that would be permitted 


5^0 
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if axial force alone existed 


= compressive bending stress that 
would be permitted if bending mo¬ 
ment alone existed 


K = 


1.49 X 1J» 


,Ksi 


(5-119) 


(In Eq. 5-119, for Fj. is the actual 
unbraced length (in.) in the plane of 
bending, is the corresponding radius of 
gyration (in.), and k is the effective length 
factor in the plane of bending.) 


4 = computed axial stress 

= computed compressive bending 
stress at the point under con¬ 
sideration 


- a coefficient whose value shall be 
taken as follows: 

(a) For compression members in frames 
subject to joint translation (side- 
sway): 

C^=0.85. (5-120) 

(b) For restrained compression members 
in frames braced against joint trans¬ 
lation and not subject to transverse 
loading between their supports in 
the plane of bending: 

C = 0.6 + 0 4 

but not less than 0.4, (5-121) 

where is the ratio of the 

smaller to larger moments at the 
ends of that portion of the member, 
unbraced in the plane of bending 


under consideration. M^IM 2 is posi¬ 
tive when the member is bent in 
single curvature and negative when it 
is bent in reverse curvature. 

(c) For compression members in frames 
braced against joint translation in 
the plane of loading and subjected to 
transverse loading between their 
supports, the value of may be 
determined by rational analysis. 
However, in lieu of such analysis, the 
following values may be used. 

For members whose ends are 
restrained: 

C = 0.85 

fn 

2. For members whose ends are 
unrestrained: 

=1.0 (5-122) 

b. Axial Tension and Bending 

Members subject to both axial tension and 
bending stresses shall be proportioned to 
satisfy the requirements of Eq. 5-118 where 
and are taken, respectively, as the 
computed and permitted bending tensile 
stress. 

Eqs. 5-116, 5-117 and 5-118 are known as 
the interaction equations. These equations, of 
course, are derived from the linear super¬ 
position of the direct stre^^ under axial load 
alone and the bending stress under bending 
moment alone. The factor is 

used in Eq. 5-117 to account for the magnifi¬ 
cation of the primary bending moment due to 
the axial load. This factor depends upon the 
type of loading and end conditions of the 
member. The value of the coefficient C can 

fn 
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be derived for various types of loadings and 
members, but the values recommended in 
Eqs. 5-120, 5-121, and 5-122 are conservative 
and are used in the present work. For a 
detailed development and discussion of these 
equations the reader is referred to Ref. 32. 

The allowable compressive stress formula, 
Eq. 5-115, is derived based on the basic 
theory of column buckling. It is obtained by 
dividing the Euler buckling stress by a factor 
of safety of 1.92. Therefore, £/[ 1.92 

X (kL/r)^ ] and, taking £ = 3.0 x 10^ Ksi, 

= 1.49 X \0^l(kL/r)^, Eq. 5-115 is applicable 
when the largest slenderness ratio kLIr is 
greater than or equal to C^. Experiments have 
shown that when kLir < , the values of the 

failure stress predicted by the Euler critical 
stress formula are seldom attained (Ref. 32). 
This is due to the presence of residual stresses 
and other imperfections in fabrication of the 
members. Therefore, when kL/r < the 
values of the allowable stress are found 
from Eq. 5-111 which is derived based on the 
parabolic approximation of the curve-critical 
stress F^ versus the slenderness ratio kLir in 
the range kLJr < C^. This approximation is 
chosen based on the experimental results 
obtained at Lehigh University (Ref. 32). The 
value of the constant is found by assuming 
that the Euler critical stress formula holds 
until the critical stress is F^jl, Therefore, 

or 

where L and r must be expressed in the same 
units. 

The factor of safety is used to account for 
small imperfections of form and loading, and 
variations of support and restraint conditions 
from those assumed in computation, which 
cause the true effective length to be different 
from that calculated. The factor of safety 
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given by Eq. 5-112 includes an allowance for 
both of these factors and is adjusted to 
account for their varying influence. For short 
columns, Eq. 5-112 approaches the basic 
safety factor in tension (1.67); and, at e = 1, 
it becomes 15% higher (1.92), a value which is 
then used in the case when kljr exceeds C^. 
Eq. 5-112 is an approximation of a quarter 
sine wave between the two limits, the curve 
used in the specification as best representing 
the influence of the two factors. For a 
detailed discussion of the these factors, the 
reader is again referred to Ref. 32. 

The effective length factor k for each 
member of the frame is found from the 
differential equation 


d^y 

dx^ 



(5-123) 


where P is the buckling load, and / is the 
second moment of the cross-sectional area. 


The solution of this equation is given by 


y(x)^Di sin 



X + £>2 cos 



(5-129) 


In rigid frames, two cases must be discussed; 
(1) frames without sidesway, and (2) frames 
with sidesway. The transcedental equation 
that comes from Eq. 5-124, while satisfying 
the boundary conditions for a member of the 
frame without sidesway, is given by Ref. 32 

X sin (jr/fc) - Gg){-Klk)'^ + 2^ 

X cos {Ttfk) + 2 = 0 (5-125) 
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where and Gg are given by the following equations is used in finding the roots of Eqs. 
equations: 5-125 and 5-128 in the present work. 






(5-126) 


(5-127) 


The subscripts A and B refer to the two ends 
of the member under consideration and the 
subscripts c and b refer to the compressed and 
restraint members Respectively. For , the 
summations extend over all members that are 
connected to joint A and for Gg the summa¬ 
tions extend over all the members that are 
connected to joint B. So, for the first case of 
a frame without sidesway, the value of the 
effective length factor k must be found by 
solving the transcedental Eq. 5-125 for each 
member of the frame. 


For the second case, i.e., a rigid frame with 
sidesway, the transcedental equation that 
comes out of Eq. 5-124 - while satisfying the 
boundary condition for a member of the 
frame-is given by 


The interaction Eqs. 5-116, 5-117, and 
5-118 are implemented at the point of maxi¬ 
mum bending moment for an element. If 
there are no loads between the end points of 
an element, then the maximum bending mo¬ 
ment is at one of the ends; otherwise the 
actual point of maximum bending moment is 
found and the interaction equations are 
implemented there. As an example, consider 
the case of a uniformly distributed load on a 
frame element (Fig. 5-25); the moment at a 
distance x from the left end is given by 

wx^ 

— 2 - 

and 

-r-‘-Fl-wx = 0 (5-130) 

OX ^ 


or 


^2 


(5-131) 


Therefore, from Eq. 5-129 

= -F^+ (Fp^l(2w) . (5-132) 


[C^ Gg (ir/k)^ - 361 sin (jr/fc) 

- 6((?^ + Gg )iir/k) cos ( ir/k) = 0, 

(5-128) 

where G^ and Gg are given by Eqs. 5-126 
and 5-127, respectively. Thus, for the case of 
a frame with sidesway, Eq. 5-128 must be 
solved for k for each member of the frame. 
The secant method of nonlinear algebraic 


Eq. 5-132 is used in computing the maximum 
bending stress required in the interaction 
equations. Now, the implementation of the 
interaction equations will be considered one 
by one and the vectors and b^Jbb will 

be computed in each case. For the sake of 
simplicity, let N be the direct force on the 
element, M be the maximum bending 
moment, k. be the effective length factor, and 
Lf be the length of the member to which the 
fth element belongs. 
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b. Interaction Equations: 

(1) Interaction Eq. 5-116: 

Eq. 5-116, for the maximum stress in the 
ith element, can be written as 


N 








-1.0<0. (5-133) 


where is the beam bending stiffness. 

In case Eq. 5-133 is violated, then one must 
compute A0^, and 30^/33. 

Therefore, from Eq. 5-133 b^Jbz = 0 and 

/ ^ ^max \ 


Differentiating Eq. 5-133 with respect to z, 
1 bN 1 3Af_ 


d0, = 


bz bz 


(5-135) 


The value of ^M^^^jbz depends on the 
expression that defines If ^ oc- 

ftiox Max 

curs at an end of the element, then 

(5-136) 



■FiorFi 

and 



bF*3 3Fi 

bz 



occurs at a I 


(5-137) 


rriifA - 

ends, then Eq. 5-132 gives its value, and 




bz 


bz w bz 


(5-138) 


Again, differentiating Eq. 5-133 with respect 
to b. 


30, 1 fbN N bbl\ 

^'’A^F^\^~ 2bf bb) 


N 3F, 1 


I—-i^rW 


The value of can be found from 

Eq. 5-136 or Eq. 5-132, which are as follows: 



dFi 

dFi 

db 

“ir" 



dFi 

Fi dFi 

db 

"aT 

w bb 


(5-140) 


The value of bFJbb required in Eq. 5-139 
is found from Eq. 5-111 or Eq. 5-115. First, if 
< C^, then Eq. 5-111 gives the value of 


Pa 


bP. Fye, r 3 

db ~4b,{F.S.) L''^8(F.S.) 

r 

A:,(F.S.) \_‘f 


8 (F.S.) 




where 




(5-143) 
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When Eq. 5-115 gives the 


value of F and 
o 


p. 2F bk, 

— —-(5-144) 

bb 2bf db bb 


Substituting the appropriate expressions in 
Eq. 5-139, the value of b^Jbb can be found. 

(2) Interaction Eq. 5-117: 

Eq. 5-117 for the maximum stress in the 
fth element can be written as foUows: 


N 

m fW% fwmmX ^ ^ ^ MV 

«^=-—+-;-1.0 <0 (5-145) 




db, 

— ^r-r^ 2r < ^ 

\pi tl/2 |_4&, ' db 

1 dN 2N dit^l 

“ lb ~ k,A,F^ db J 


where 
^2 = 


t^) 


(5-150) 


A,P, 


where 




(3) Interaction Eq. 5-118: 

Next, consider Eq. 5-118, which may be 
written as follows for the maximum stress in 
the ith element 


(5-146) 


N 




If this constraint is violated, then one must 
compute A0^, b^Jb^, b^Jbz, and b^Jbb. In 
this case b^Jb^ = 0 and 


K 


O.eF^A, Z,F^ 
Therefore, d^Jd^ = 0 and 


- 1.0< 0. .(5-151) 


/ N \ . t ^ ^max \ 

A4 = _(-+ - 1.01. (5-147) A« = -( ■ +-=-^ - l O). (5-152) 

Y,Pa / \0.6Fy^, Z,F„ ) 


Differentiating Eq. 5-145 with respect to z 
and b, one obtains 


Differentiating Eq. 5-151 with respect to z 
and b, one obtains 


37V 

bz 




1 dN 1 dM, 


H, _ 

dz 0.6F A, dz Z,F,^ dz 


(5-153) 


(5-148) 


and 


and 

~db 


d^, 1 /dN N db,\ N dF^ 

^~A^^ ~2b, IbjA,!^^ db 


I, 1 /dN N db,\ 

r ~oJF^,^r ~lb,^J 

Z,f\ db 4b, db/ 


(5-154) 
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It may be noted in the previous equations 
that 

bb, (/th) 

1,0,.,0) (5-155) 

The value of the vectors such as bN/bz, 
bNIbb, bFi Ibz, bF^ Ibb can be found directly 
from the Eq. 5-102. The vectors bC^/bz and 
bCm Ibb are zero for cases prescribed by Eqs. 
5-120 and 5-122. For the case prescribed by 
Eq. 5-121, they are computed using the chain 
rule of differentiation. It remains to find 
value of the vector bkjbb. This vector can be 
computed by differentiating Eq. 5-125 or Eq. 
5-128 with respect to the design variable 
vector b. However, due to the fact that both 
and are functions of b, this computa¬ 

tion is quite tedious and time consuming on 
the computer. Another approach that may be 
followed for computing bkjbb is to use the 
method of finite differences; but this ap¬ 
proach is equally time consuming on the 
computer. Moreover, it has been observed in 
the numerical computation that the value of 
k^ does not change appreciably from one 
design cycle to another. Therefore, without 
significant loss of accuracy, the value of k. in 
a particular design cycle is treated as a 
constant. However, at the start of each design 
cycle, k values for all the members of the 
frame are recomputed. Thus, following this 

procedure, bk^/bb = (0,., 0). Now, all the 

necessary information is available to assemble 
matrix A of Eq. 5-20. 


5-6.3 EXAMPLE PROBLEMS 

Several rigid frames were optimized using 
the computer program based on the algorithm 
of par. 5-3.1. All the problems were solved 
with stress, displacement, frequency, and de¬ 
sign variable constraints. Example problems 
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5-4 and 5-5 that follow also are treated in 
Ref. 28 and were first designed for only stress 
constraints in order to compare results with 
those of Ref. 28. 

1. Example 5-4. Simple Portal Frame 

Fig. 5-26 shows the dimension of the 
frame. The moment of inertia for each ele¬ 
ment of the frame is treated as an unknown, 
and the results obtained are shown in Table 
5-12. The frame was first designed with only 
stress constraints. The final weight in this case 
was 3050.5 lb with a computation time of 
3.74 sec for 13 cycles. At the final design 
point, the maximum constraint violation was 
0.012% for stress in element 2. Optimal 
weight reported in Ref. 28 was 3206 lb, 
which is higher by approximately 5%. 

The frame was also designed by including 
all the constraints. The resonant frequency 
limit for the structure was 25.0 Hz and the 
final weight obtained in this case was 3803.0 
lb with a computation time of 14.60 sec for 
31 iterations. At the final design point, the 
maximum constraint violation was 0.0073% 
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Figure 5-26, Simple Porta! Frame (Example 
5-4) 
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TABLE 5-12 

SIMPLE PORTAL FRAME (EXAMPLE 5-4) 

Design Information: For each element of the frame* the modulus of elasticity* the specific weight* and the yield 
stress are 3 X 10^ kips/in.^* 0.2836 Ib/in.^* and 36.0 kips/in.^* respectively. The constants Cj, and dj are 0.58* 
0.58* arni 0.67* respectively. The lower limit on the moment of ir^ertia of each element is 1.0 in.^ and there is no 
upper limit. The resonant frequency is 25.0 Hz and the displacement limits are 0.5 in. at nodes 2* 3* and 4 in both 
Xf and X 2 -directions. There are three loading corniitions for the frame; first is uniformly distributed load of - 0.5 
kip/in. on elements 2 and 3* secorni is a load of 45.0 kips in x-direction at node 2* and the third is a load of - 45.0 
kips in X| -direction at node 4. 


With Only Stress Constraints 
Computation time * 3.74 sec 

With All the Constraints 
Computation time * 14.60 sec 

EL 

Starting Values* 

Final Values* 

El. 

Starting Values* 

Final Values* 

No. 

in/ 

in/ 

No. 

in/ 

in/ 

1 

1600.0 

1091.4 

1 

1600.0 

1995.5 


1600.0 

768.3 


1600.0 

860.3 


1600.0 

768.3 


1600.0 

860.3 


1600.0 

1091.3 

■■ 

1600.0 

1995.5 

Wt. lb 

3947.7 

3050.5 

Wt* lb 

3947.7 

3803.0 


for stress in element 2. Fig. 5-27 shows the 
variation of the objective function as the 
iterations progress. It may be noted that, for 
practical purposes, convergence was obtained 



Iteration Number 

Figure 5-27. Iteration i/s Weight Curves for 
Example 5^, Simple Porta! 
Frame 


in only 5 iterations in the first case and in 7 
iterations in the second case. However, in the 
second case, the cost function continued to 
reduce for a few cycles beyond the 7th 
iteration without correcting the constraints. 
This was due to the fact that the step size for 
the problem was too large. 


2. Example 5-5. One-bay Two-story Frame 

The present example is also treated in Ref. 
28. Fig. 5-28 shows the dimensions and the 
loading conditions for this structure. Input 
and output information for this example is 
given in Table 5-13. This frame was first 
designed for stress constraints only. The final 
weight in this case was 8292.0 lb with a 
computation time of 21.47 sec for 32 itera¬ 
tions. Maximum constraint violation at the 
design point in this case was 0.27 percent for 
stress in element 3. The comparable fmal 
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weight reported in Ref. 28 was 8810 lb, 
which is again higher by approximately 5.8%. 



Figure 5-28. One-bay, Two-story Frame 
(Example 5-5) 


The frame was also designed while en¬ 
forcing all constraints. The resonant frequen¬ 
cy limit in this case was 15.0 Hz. The final 
weight obtained in this case was 9722.5 lb 
with a computation time of 48.84 sec for 32 
iterations. Maximum constraint violation was 
0.38 X 10"^% for displacement of node 3 in 
the Xj-direction. Fig. 5-29 shows variation of 
the cost function with respect to iteration 
number, and it may again be noted that 
convergence was obtained in 8 cycles in both 
the cases. 

3. Example 5-6. Two-bay Six-story Frame 

Figure 5-30 shows the geometry and 
dimensions of the frame. This frame has 21 
joints, 30 members, and 54 degrees of free¬ 
dom. The frame was designed for four loading 
conditions, and the input and output informa- 


TABLE 5-13 

ONE-BAY, TWO-STORY FRAME (EXAMPLE 5-5) 

Design Informethn: For each element of the frame, the modulus of elasticity, the specific weight, and the yield 
stress are 3 X 10^ kips/in.^, 0.2836 Ib/in.^, and 36.0 kips/in.^, respectively. The constants#^ c^, and an 0.58, 
0.58, and 0.67, respectively. The lower limit on the moment of inertia of each element is 1.0 in.^ and there is no 
upper limit. The resonant frequency for the frame is 15.0 Hz and the displacement limits are 1.0 in. at nodes 2, 3,4, 
5, 6, and 7 in both X] -and X 2 -diractions. There are three loading conditions for the structure, and they are as shown 
on Fig. 5-28. 


With Only Stress Constraints 


With All the Constraints 

Computation time * 21.47 sec 

Computation time * 48.84 sac 

El. 

Starting Values, 

Final Values, 

El. 

Starting Values, 

Final Values, 

No. 

in.^ 

fci/ 

No. 



1 

6400.0 

3264.8 

1 




6400.0 

901.4 





6400.0 

801.5 





6400.0 

801.5 



845.7 


6400.0 

2598.7 



4618.8 


6400.0 

2598.7 



4618.8 


6400.0 

901.4 


6400.0 

1436.3 

8 

6400.0 

3267.4 

8 

6400.0 

3794.0 

Wt,lb 

15790.8 

8292.0 

Wt, lb 

15790.8 

9722.5 
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Iteration Number 

Figure 5-29. Iteration Number vs Weight 
Curves for Example 5-5; One- 
bay, Two-story Frame 


tion for the problem is given in Table 5-14. 
The frame was first optimized by imposing 
the stress constraint only. The optimum 
weight in this case was 21706.6 lb with a 
computation time of 8.32 min for 21 itera¬ 
tions. At the final design point, the maximum 
constraint violation was 0.025% for stress in 
element number 25. Next, the frame was 
designed by imposing all the constraints. The 
optimum weight in this case was 24290.1 lb 
with a computation time of 8.7 min for 20 
iterations. At the final design point, the 
maximum constraint violation was 0.0072% 
for displacement in the -direction at node 
1. All other violations were less than that. 

Fig. 5-31 shows variation of the cost 
function with respect to the iteration number. 
The starting point in this case was quite a 
distance away from the optimum point. 
Therefore, a larger step size was used in the 
first few iterations. Also, it was observed from 


the first few iterations that reductions in the 
values of the design variables for elements 23, 
24, 25, 28, 29, and 30 were relatively smaller 
than those of other elements. This is due to 
nature of the gradient of objective function 
for this problem (Eq. 5-106). So, the values of 
these design variables were reduced con¬ 
siderably at the 7th iteration. This is shown 
by the vertical drop in the graph at the 7th 
iteration on Fig. 5-31. In the second case, 
where all the constraints were considered, 
variation of the cost function with respect to 
the iteration is shown in Figure 5-32. In this 
case, the starting point was infeasible and the 
convergence was obtained in 8 iterations. 


^2 



Figure 5-30. Two-bay, Six-story Frame 
(Examine 5-6) 
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TABLE 5-14 

TWO-BAY. SIX-STORY FRAME (EXAMPLE 5-6) 

Design Information: For each element of the frame the nrKxiulus of elasticity, the specific weight, and the yield 
stress are 3 X 10^ kips/in.^. 0.2836 Ib/in.^. and 36.0 kips/in.^. respectively. The constants and are 0.58. 
0.58, and 0.67, respectively. The lower and the upper limits on the moment of inertia of each element are 394.5 in.^ 
and 6699.0 in.^. respectively. The resonant frequency of the structure is taken as 4.0 Hz and the displacement limits 
are 2.0 in. at all nodes in both xi - and -directions. There are four loading conditions for the frame: (1) Uniform¬ 
ly distributed load of — 4.0 kips/ft on element 1. 7.11. 17. 21. and 27. and — 1.0 kip/ft on elements 2. 6. 12.16.22. 
and 26; (2) Uniformly distributed load of — 4.0 kips/ft on elements 2. 6. 16. 22. and 26. and — 1.0 kip/ft on 
elements 1. 7. 11. 17. 21. and 27; (3) Uniformly distributed load of - 1.0 kip/ft on elements 1. 2. 6. 7.11.12. 16. 
17. 21. 22. 26. and 27. and loads of 9.0 kips each at nodes 1. 4. 7. 10. 13. and 16 in direction of the x-axis; (4) 
Uniformly distributed load of - 1.0 kip/ft on elements 1. 2. 6. 7. 11. 12. 16. 17. 21. 22. 26. and 27. and loads of 
- 9.0 kips each at nodes 3.6. 9.12.15. and 18 in direction of Xi - axis. 


With Only Stress Constraints 


With All Constraints 

Computation time * 8.32 min 

Computation time ■ 8.70 min 

El. 

Starting Values. 

Final VakMt, 

El. 

Starting Values. 

Final Values. 

No. 

In/ 

HI. 

No. 

in.^ 

in.^ 

1 

2400.0 

450.6 

1 

394.5 

473.8 

2 

2400.0 

450.6 


394.5 

473.8 

3 

2400.0 

498.6 


394.5 

467.2 

4 

2400.0 

394.9 


394.5 

437.5 

5 

2400.0 

498.6 


394.5 

467.2 

6 

2400.0 

530.8 


394.5 

568.7 

7 

2400.0 

530.8 


394.5 

569.1 

8 

2400.0 

394.3 

8 

394.5 

394.5 

9 

2400.0 

397.1 

9 

394.5 

608.5 

10 

2400.0 

394.1 

10 

394.5 

394.5 

11 

3200.0 

481.8 

11 

450.0 

787.0 

12 

3200.0 

481.7 

12 

450.0 

786.4 

13 

3200.0 

425.3 

13 

400.0 

412.7 

14 


472.7 

14 

450.0 

794.1 

15 


425.3 

15 

400.0 

412.6 

16 


521.9 

16 

550.0 

930.2 

17 


521.7 

17 

550.0 

930.0 

18 

4000.0 

468.3 

18 

550.0 

561.9 

19 


723.5 

19 

750.0 

920.4 

20 

4000.0 

467.5 

20 

550.0 

561.8 

21 

4800.0 

699.1 

21 


1019.1 

22 

4800.0 

699.1 

22 

600.0 

1018.7 

23 

4800.0 

646.3 

23 

700.0 

693.4 

24 

4800.0 

1044.5 

24 

1100.0 

1197.0 

25 

4800.0 

646.5 

25 

700.0 

693.3 

26 

5600.0 

666.4 

26 

600.0 

868.5 

27 

5600.0 

666.4 

27 

600.0 

867.9 

28 

5600.0 

1099.0 

28 

1200.0 

1245.4 

29 

5600.0 

1489.7 

29 

1600.0 

1658.6 

30 

5600.0 

1099.0 

30 

1200.0 

1246.3 

Wt.lb 

54290.9 

21706.6 

Wtjb 

21243.6 

24290.1 
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Iteration Number 

Figure 5-31. Iteration Number vs Weight Curves 
for Example 5~6; Two-bay, Six- 
story Frame, With Stress Constraints 
Only 



Figure 5-32. Iteration vs Weight Curves for 
Example 5-6; Two-bay, Six- 
story Frame, With AH Constraints 


5-7 INTERACTIVE COMPUTING IN 
STRUCTURAL OPTIMIZATION 

5-7.1 THE INTERACTIVE APPROACH 

Structural optimization techniques treated 
thus far consist of methods which seek to 
determine an optimum design, within a well- 
defined mathematical structure, by purely 
mathematical techniques. A second approach 
consists of providing the designer with an 
interactive computing tool with which he can 
try nominal designs, get rapid analysis feed¬ 
back, and alter his initial design based on his 
knowledge of structural behavior. Both meth¬ 
ods have been used with varying degrees of 
success on a variety of design problems. In 
general, the first approach has been used for 
problems with well-defined optimality cri¬ 
teria, such as minimum weight or maximum 
stiffness. The second approach has been used 
to aid designers in large scale structural design 
problems, primarily airframe design, such as 
the Air Force C-5 transport aircraft. 

The possibility of utilizing a combination 
of these two methods for structural design has 
been the subject of a recent paper (Ref. 36). 
This paragraph presents the specifics of appli¬ 
cation of the steepest-descent technique with 
designer interaction. This hybrid approach is 
appealing from a number of points of view. 
First, the problem of topological design, i.e., 
determination of optimum structural configu¬ 
ration, has been addressed with very limited 
success from an analytical point of view. 
Topological design, in practice, is done by 
experienced structural designers, occasionally 
with the aid of interactive computation. 
Combined analytical and interactive com¬ 
puting methods appear to be essential for this 
important class of problems. A second prob¬ 
lem area arises due to the difficulty in 
formulating a single optimality condition and 
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mathematically precise design constraints. 
Often, conflicting design constraints and ob¬ 
jectives arise during design which require 
experienced judgment and defy a priori math¬ 
ematical formulation. Such problems appear 
to require an interactive computing capability 
but should profit from analytical methods 
that are used in automated structural optimi¬ 
zation. 

Due to unavailability of a large scale, 
interactive system, the computations for this 
study were simulated. Instructions were pre¬ 
pared and computations were run in the batch 
mode. Output data were then displayed and 
analyzed just as they would be in the inter¬ 
active mode, and instructions for recomputa¬ 
tion were given by the designer and the 
process repeated. The delay in designer inter¬ 
action is felt to degrade performance some¬ 
what, over true interactive computing, since 
the designer tends to forget pertinent detailed 
data during the time delay. For this reason, 
the results of this study should provide a 
conservative estimate of the designer’s per¬ 
formance in a truly interactive mode. 

5-7.2 INTERACTIVE STRUCTURAL DE- 
SIGN USING SENSITIVITY DATA 

The steepest-descent optimization method 
developed in this Chapter has been used to 
solve a number of relatively large scale struc¬ 
tural optimization problems with good 
success. All these problems, however, have 
been well formulated mathematically and 
have involved structures with a predetermined 
form. Difficulties have occurred when certain 
structural elements tend toward zero cross 
section. Further, no universal method has 
been found to determine the best step-size r? 
in the optimization algorithm. These and 
other inherent difficulties in automated opti¬ 
mization lead one to inteiject an experienced 
designer into the computational, optimization 
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algorithm. The result is a hybrid structural 
optimization technique. 

Reconsidering the design improvement step 
of the optimization algorithm, one might 
draw a vector picture in design space, as is 
depicted in Fig. 5-33. Here, - ridb^ is the 
direction which will yield the greatest re¬ 
duction in J subject to the required con¬ 
straints, and db^ is the design change required 
to give the desired constraint error correction. 
While useful in this form, there is a better 
display of these data for use by the experi¬ 
enced structural designer. The scalar com¬ 
ponents of - db^ and 6b^ tell the designer 
whether he should increase or decrease his 
individual design variables to obtain desirable 
changes in overall structural response. Fur¬ 
ther, relative importance of design variable 
changes is given. For this reason, 8b^ may be 
interpreted as a vector of design sensitivity 
coefficients that relate individual design 
parameter changes to overall structural 
characteristics. It is extremely important to 
note, at this point, that these sensitivity 
coefficients account for constraints implicitly; 
i.e., the direction of change indicated in the 



Figure 5-33. Vector Change in Design 
Space 
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design parameters will not cause signiflcant 
violation in specified performance constraints 
such as stress limits and deflection limits. 

To illustrate these ideas, consider the 
simple structural design problem in Fig. 5-34. 



Figure 5-34. Three-bar Truss 

The cost function here is structural weight. If, 
for example, the stress in member 1 is at its 
allowable limit under one of the loads, then 
the indicated changes in design (- 6bi, 
- 6 ^ 2 . “ 5 ^ 3 ) will not increase the stress in 
member 1. To make the design sensitivity 
data of maximum use to the designer, con¬ 
sider the graphical display in Fig. 5-35. In this 
di^lay, are the stresses in the various 
members. This display gives the experienced 



Figure 5-35. Display of Design Sensitivity 
Data 


designer a clear picture of the manner in 
which he should change his design parameters 
to reduce total weight, subject to stress 
constraints. He can now choose the desired 
reduction A 7 in weight and take the resulting 
design change 5b, or if he wishes, he can input 
modified design changes through an inter¬ 
active computer terminal. 

There are a number of other respects in 
which this mode of designer interaction with 
the computer alogrithm is beneficial. First, it 
often happens in the automated use of the 
algorithm that oscillation of admissible de¬ 
signs occurs because too large a design im¬ 
provement has been requested. Such oscilla¬ 
tion can often be identified by the designer 
after only a few iterations and the step size 
can be reduced to prevent loss of computer 
time, which can be significant in large scale 
problems. Conversely, if an estimate quite far 
from the optimum is chosen to initiate the 
algorithm, it often happens that the designer 
chooses far too smaU a step size. The result is 
a very small improvement in the design which 
can be sensed by the designer and improved 
before excessive computation time is ex¬ 
pended. 

A second important benefit from designer 
interaction with the algorithm arises due to 
the occurrence of local minima and singu¬ 
larities in the analytical formulation of the 
design problem. The problem of local minima 
is illustrated by Fig. 5-36. Virtually all opti¬ 
mization methods seek local optima and do 
not solve the global optimization problem. It 
is easy for an optimization technique to get 
hung up at point B and not get to point A, 
which is the global minima, so the designer 
must try different starting points to obtain 
the global solution. This is a very time 
consuming and indefinite technique with very 
few analytical aids to the designer. Part of the 
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difficulty here arises because Figure 5-36 is 
the wrong display for the designer in that it 
does not utilize his knowledge and experience 
with structures. 

A much better approach for the designer is 
to look at a display such as Figure 5-35. He 
can use his experience to restart the optimiza¬ 
tion algorithm at a meaningful distribution of 
design variables which may be quite different 
from the design which resulted from previous 
calculations. His experience, thus, aids him in 
starting with different trial designs. 

Perhaps even more important than trying 
various distributions of design variables, the 
designer can utilize the display of Fig. 5-35 to 
change the configuration of the structure 
based on information he accumulates during 
iterative design and based on his experience. 
For example, he might try taking member 2 
out of the structure and optimize based on 
the modified configuration. Very often, 
significant gains are made in this manner. 
Precisely this behavior occurs in the three 
member truss being considered. 

There are actually compelling mathematical 
reasons for allowing the designer to make 
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changes in configuration as outlined. There 
are no general optimization methods, to date, 
which will remove a member during iterative 
design. The reason is that as a member cross 
section goes toward zero, as is required to 
remove a member, the equations of structural 
mechanics and stress constraints become 
singular. This sort of behavior is typical when 
the configuration of a system is changed and a 
different set of equations is required to 
describe the behavior. At the present time, 
allowing the designer to make changes in 
configuration appears to be the most feasible 
approach, which requires that he play an 
active role in the iterative optimization al¬ 
gorithm. 

5-7.3 EXAMPLE PROBLEMS 

I. Example 5-7. A Three-member Truss 

As an illustrative example of the technique 
presented in par. 5-7.2, an elementary opti¬ 
mal design problem will be solved under a 
number of loading conditions and a variety of 
constraints. The effect of designer-computer 
interaction on rate of convergence is ex¬ 
amined as well as the effect of changing 
structural configuration. 

Figure 5-37(A), shows the geometry and 
dimensions of the structure being considered. 
This structure has been studied by Schmit 
(Ref. 37), Sved and Ginos (Ref. 38), and 
Corcoran (Ref. 35). Three independent load¬ 
ing conditions are applied to the structure. 
These are as follows: 40K at 45 deg; 30K at 
90 deg; 20K at 135 deg. The allowable stress 
level for members 1 and 3 is ± 5 Ksi and for 
member 2 it is ± 20 Ksi. The density of the 
material is taken as 0.10 Ib/in.^ and Young’s 
modulus as 10^ Ksi. Starting from the feasible 
solution, bi = 8.0, bj = 2.4, b^ = 3.2, Schmit 
(Ref. 37) arrived at the solution bi = 7.099, 
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(B) CotcoranTiun 

Figure 5-37, Trusses (Example 5-7) 


b 2 = 1.849, ^3 = 2.897, for which /= 15.986 
lb. Sved and Ginos (Ref. 38) have shown that 
this is only a local minima and by omitting 
member 3, they obtained the solution as 
bi =8.5, b 2 = 1.5 with /= 12.812 lb. They 
have also shown that it is impossible to reach 
this minimum by an iterative optimization 
method unless member 3 is omitted from the 
calculations by the designer. Corcoran (Ref. 
35) has considered configurational optimiza¬ 
tion of this three-bar truss. By using horizon¬ 
tal coordinates of nodes 1, 2, and 3 also as 
design variables, he arrived at an optimum 
structure shown in Fig. 5-37(B). As a result of 
this configurational optimization procedure, 
members 1 and 3 were combined and their 
orientation is shown by member 1 of Fig. 
5-37(B). Member 2 attained an orientation as 


shown in this figure. The final solution 
obtained by Corcoran was hi =4.241, 
b 2 = 2.038 with / = 7.55 lb. 

Considerable experimentation was done 
with this problem. Starting from a feasible 
point hi = 10, ^2=5, ba = 5, the solution 
obtained without interaction was bi = 7.064, 
ba = 1.971, ba = 2.835 and the minimum was 
J = 15.97 lb. The variation of weight with 
respect to iteration number is shown by C!urve 
1, Fig. 5-38. Next, by adjusting the step size 
in interactive computing, the solution was 
obtained in only five iterations. This is shown 
by (Turve 2, Fig. 5-38. It was observed that 
member 2 never reached its allowable stress 
level. As a second starting point, the area of 
member 2 was initially chosen to bring its 
stress to the allowable limit. The minimum 
reached in this case was the same as before. 
Curve 3, Fig. 5-38. Another solution was 
obtained by starting from an infeasible point 
bi = 5.0, ba = 1.5, ba = 0.10. The solution in 
this case was bj = 6.98, b 2 = 2.30, ba = 2.68 
with/= 15.97 lb. Curve 4, Fig. 5-38. 

Next, member 3 was omitted from the 
structure. Starting from a point bi = 10, 
ba = 5, the solution obtained was b, = 8.0, 
b2=1.5 with /= 12.812 lb, Curve 1, Fig. 
5-39, which is same as reported in Ref. 38. At 



Iteration Number 

Figure 5-38. iteration vs Weight Curves for 
Example 5-7, Three-bar Truss 
With Stress Constraints Only 
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Figure &39. iteration vs }Neight Curves for 
Example 5-7, Three-bar Truss 
With AH Constraints 


this point an interesting observation was 
made. The maximum horizontal and the 
vertical deflections of node 4 were as follows: 
with three bars, Zi =0.689 x 10”^ in., 
2 2 = 0.595 X 10”^ in.; with two bars, 
2 i = 0.239 X 10“^ in., and Z 2 = 0.20 x 10“Mn. 
Thus, although the optimum weight obtained 
by omitting member 3 is approximately 24% 
lower than the weight obtained by including 
member 3, the deflections of node 4 in the 
former case were approximately four times 
greater than in the latter case. 

One might be led to believe that if 
deflection or frequency constraints were en¬ 
forced, then the optimum structure might not 
be statically determinate. To investigate this 
possibility, displacement as well as buckling 
and natural frequency constraints were im¬ 
posed. The deflection limits were taken as 
2 i = ± 0.005 in. and 22 =±0.005, and the 
lower limit on natural frequency was taken as 
3830 Hz. With the starting point hi = 10, 
hi =5, ha = 5, the solution obtained was 
hi =9.18, h2=2.16, ha =3.85, and 

/= 20.59 lb. Curves 2 and 3, Fig. 5-39. When 
member 3 was omitted, the starting point was 
taken as hi = 10, ha = 10, Curve 4, Fig. 5-39, 
and as hi = 18, ha = 10, Curve 5, Fig. 5-39. 
The solution obtained in this case was 
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hi = 16.0, ha = 11.31, and/= 33.94 lb. Thus, 
the optimum weight obtained for this statical¬ 
ly determinate case is approximately 70% 
higher than the optimum weight obtained for 
the statically indeterminate case. 

It was found that interactive computing 
yielded convergence more rapidly than was 
the case in the batch mode. It is expected that 
even more significant reduction in computing 
time will occur in large scale problems. 

This problem was also solved by omitting 
member 2 from computation. The results 
obtained in this case are given in Columns 3 
and 7 of Table 5-15. The truss optimized by 
Corcoran (Ref. 35) was also solved here by 
first imposing the stress constraints only and 
then by considering all the constraints. The 
results of these cases are given in Columns 4 
and 8 of Table 5-15. 

The key point in the solution is that the 
configuration of the optimum design is not 
obvious from analytical considerations. A 
designer’s experience and insight are required 
to select candidate configurations and then 
obtain the optimum design analytically. The 
global solution in this case must be chosen by 
comparing relative minima. It may be ex¬ 
pected, in structures with greater redundancy, 
that certain members may be removed during 
interactive computation when they are ob¬ 
served to approach their allowable lower 
limits. 

An interesting point, illustrated by Table 
5-15, is that a statically determinate truss is 
optimum when only stress constraints are 
imposed. Quite the contrary, when the full 
range of constraints are imposed, a statically 
indeterminate truss is optimum (not consider¬ 
ing the configurational optimization). 
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TABLE 6-15 

OPTIMUM THREE-MEMBER TRUSSES (EXAMPLE 6-7) 


El 

With Only Strtsf Constraints 

With All Constraints 

No. 

Final Araa, in.^ 

Final Araa, in.'^ 


1 

2 

3 

4 

5 

6 

7 

8 

1 

7.064 

8.500 

7.991 

Corcoran 

Truss 

9.180 

16.00 

8.485 

Corcoran 

Truss 

2 

1.971 

1.500 

- 

4.246 

2.160 

11.310 

- 

4.247 

3 

2.835 

- 

4.243 

2.039 

3.850 

- 

8.485 

11.410 

Wt, lb 

15.970 

12.812 

17.300 

7.556 

20.59 

33.94 

24.000 

20.115 

Mm. 
Defl, in. 

0.00689 

0.02390 

0.00766 

0.02559 

0.005 

0.005 

0.005 

0.005 


2. Example 5-S. Transmission Tower 

Fig. 5-40 shows the geometry and dimen¬ 
sions of the transmission tower to be studied. 
This problem has been considered by 
Venkayya and others (Ref. 39). The tower 
has 25 members, 10 joints, 18 degrees of 
freedom, and is designed for 6 loading condi¬ 
tions. The structure is indeterminate, with a 
degree of indeterminacy of seven. 

The tower was designed by first imposing 
only stress constraints, and then by imposing 
stress, displacement, buckling, and natural 
frequency constraints. Design information is 
given in Table 5-16, and the final results 
obtained are shown in Tables 5-17 and 5-18. 
Table 5-17 shows the results when only stress 
constraints are considered, and Table 5-18 
gives the results for the corresponding cases 
when all the constraints are considered. For 
results given in Column 1 of Table 5-17, all 
the members of tower were included in the 


computation and the (Durve 1 of Fig. 5-41 
shows the variation of cost function with the 
number of iterations. The computations of 
this case were monitored to determine which 
cross sections went to their lower bounds. 

One set of members which attained their 
lower limits of cross-sectional area were num¬ 
bers 10, 11, 12, and 13. It was observed that 
these members carried small forces and could 
be removed without causing collapse of the 
tower, so they were removed from the tower. 
The final values of areas of cross section of 
the resulting structure are given in Column 2 
of Table 5-17. C^irve 2 of Fig. 5-41 shows the 
variation of cost function with respect to the 
design cycle. The final weight in this case was 
slightly less than the previous case. 

The next member that reached its lower 
limit was number 1, so it was also removed 
from the structure. The results of this case are 
given in Column 3 of Table 5-17 and Chirve 3 
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Figure &40, Transmission Tower (Example 5-8) 
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TABLE 5-16 

DESIGN INFORMATION FOR TRANSMISSION TOWER (EXAMPLE 5^) 

For each member of the structure, the modulus of elasticity the specific weight the constant a. (moment 
of inertia of /th member, stress limits are 10^ kips/in.*, 0.10 Ib/in.^, 1.0, and ± 40.0 kips/in.^, re¬ 

spectively. The lower limit on ^e area of cross section of each member is 0.10 in.^ for the case with stress 
constraints only and 0.01 in.^ for other cases. There is no upper limit on the member sizes. The resonant frequency 
for the structure is 173.92 Hz and the displacement limits are 0.35 in. on ail nodes and in all directions. There are six 
loading conditions and they are as follows (all loads are in kips): 


Load 


Diraction of Load 

Load 


Diraction of Load | 

Cond. 

Node 

X 

y 

z 

Cond. 

Noda 

X 

y 

z 


1 

1.0 

10.0 

-5.0 


1 

0 

10.0 

-5.0 

1 

2 

0 

10.0 

-5.0 

o 

2 

-1.0 

10.0 

-5.0 


3 

0.5 

0 

0 

z 

4 

-0.5 

0 

0 


6 

0.5 

0 

0 


5 

-0.5 

0 

0 


1 

1.0 

-10.0 

-5.0 


1 

0 

-10.0 

-5.0 

o 

2 

0 

- 10.0 

-5.0 

A 

2 

- 1.0 

- 10.0 

-5.0 

o 

3 

0.5 

0 

0 

•1 

4 

-0.5 

0 

0 


6 

0.5 

0 

0 


5 

-0.5 

0 

0 

c 

1 

0 

20.0 

-5.0 


1 

0 

-20.0 

-5.0 

o 

2 

0 

-20.0 

-5.0 

O 

2 

0 

20.0 

-5.0 


of Fig. 5-41. The final weight in this case was 
86.94 lb, which is given slightly less than the 
previous case. Finally, members 14, 15, 16, 
and 17 were at their lower limits of cross- 
sectional area. Removal of any of these 
members, however, would cause collapse of 
the structure. Members 2 and 5 or 3 and 4 
could be removed to make the structure 
determinate. The results for a statically 
determinate structure, obtained by removing 
members 2 and 5, are shown in Column 4 of 
Table 5-17. The final weight in this case was 
106.97 lb. It may be noted that this statically 
determinate structure yielded only a local 
optimum, Curve 4, Fig. 5-41. 

Another sequence of removing the mem¬ 
bers that reached their lower limits of area of 
cross section was also tried. Members 14, 15, 


16, and 17 reached their lower bounds but 
removal of all of these members rendered a 
structure that was geometrically unstable. 
However, members 14 and 16 or 15 and 17 
could be removed without causing the col¬ 
lapse of the structure. Results with members 
15 and 17 removed are given in Column 5 of 
Table 5-17 and similar results are obtained by 
omitting members 14 and 16 from the com¬ 
putation. The next set of members that were 
at their lower bounds and could be removed 
without making the structure unstable were 
numbers 1, 12, and 13. These were also 
removed from the structure and the results 
obtained in this case are given in Column 6 of 
Table 5-17. Two other members could be 
removed from the structure to make it 
statically determinate. Results obtained by 
removing members 4 and 5, and then numbers 
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Figure &40. Transmission Tower (Example 5-8) 
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TABLE 5-16 

DESIGN INFORMATION FOR TRANSMISSION TOWER (EXAMPLE 5-8) 

For each member of the structure, the modulus of elasticity E^, the specific weight the constant a. (moment 
of inertia of rth member, stress limits are 10^ kips/in.^, 0.10 Ib/in.^, 1.0, and ± 40.0 kips/in.^, re¬ 

spectively. The lower limit on the area of cross section of each member is 0.10 in.^ for the case with stress 
constraints only and 0.01 in.^ for other cases. There is no upper limit on the member sizes. The resonant frequency 
for the structure is 173.92 Hz and the displacement limits are 0.35 in. on all nodes and in all directions. There are six 
loading conditions and they are as follows (all loads are in kips): 



of Fig. 5-41. The final weight in this case was 
86.94 lb, which is given slightly less than the 
previous case. Finally, members 14, 15, 16, 
and 17 were at their lower limits of cross- 
sectional area. Removal of any of these 
members, however, would cause collapse of 
the structure. Members 2 and 5 or 3 and 4 
could be removed to make the structure 
determinate. The results for a statically 
determinate structure, obtained by removing 
members 2 and 5, are shown in Column 4 of 
Table 5-17. The final weight in this case was 
106.97 lb. It may be noted that this statically 
determinate structure yielded only a local 
optimum. Curve 4, Fig. 5-41. 

Another sequence of removing the mem¬ 
bers that reached their lower limits of area of 
cross section was also tried. Members 14, 15, 


16, and 17 reached their lower bounds but 
removal of all of these members rendered a 
structure that was geometrically unstable. 
However, members 14 and 16 or 15 and 17 
could be removed without causing the col¬ 
lapse of the structure. Results with members 
15 and 17 removed are given in Column 5 of 
Table 5-17 and similar results are obtained by 
omitting members 14 and 16 from the com¬ 
putation. The next set of members that were 
at their lower bounds and could be removed 
without making the structure unstable were 
numbers 1, 12, and 13. These were also 
removed from the structure and the results 
obtained in this case are given in Column 6 of 
Table 5-17. Two other members could be 
removed from the structure to make it 
statically determinate. Results obtained by 
removing members 4 and 5, and then numbers 
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TABLE 6-17 

OPTIMUM TRANSMISSION TOWERS WITH STRESS CONSTRAINTS ONLY 

(EXAMPLE &B) 



Final Araa, in.^ 

El. 









No. 

1 

2 

3 

4 

5 

6 

7 

8 

1 

0.100 

0.100 

_ 

— 

0.100 


m 

— 

2 

0.376 

0.377 

0.346 

- 

0.384 

0.364 


- 

3 

0.376 

0.377 

0.346 

0.100 

0.384 

0.366 


0.272 

4 

0.376 

0.377 

0.346 

0.100 

0.387 

0.363 

- 

0.272 

5 

0.376 

0.377 

0.346 

- 

0.385 

0.365 

— 

- 

6 

0.471 

0.470 

0.494 

0.779 

0.465 

0.484 

0.775 

0.779 

7 

0.471 

0.470 

0.494 

0.779 

0.463 

0.482 

0.779 

0.775 

8 

0.471 

0.470 

0.494 

0.779 

0.464 

0.481 

0.779 

0.779 

9 

0.471 

0.470 

0.494 

0.779 

0.463 

0.479 

0.779 

0.779 

10 

0.100 

- 

— 

- 

0.103 

0.103 

0.182 

0.182 

11 

0.100 

- 

- 

- 

0.103 

0.103 

0.182 

0.182 

12 

0.100 

- 

- 

- 

0.100 

- 

- 

- 

13 

0.100 

- 

— 

- 

0.100 

- 


- 

14 

0.100 

0.100 

0.100 

0.165 

0.151 

0.152 

0.302 


15 

0.100 

0.100 

0.100 

0.165 

- 

- 

- 

- 

16 

0.100 

0.100 

0.100 

0.165 

0.151 

0.152 


0.302 

17 

0.100 

0.100 

0.100 

0.165 

- 

- 

- 

- 

18 

0.277 

0.279 

0.292 

0.413 

0.278 

0.288 

0.413 

0.413 

19 

0.277 

0.279 

0.292 

0.413 

0.277 

0.288 

0.413 

0.413 

20 

0.277 

0.279 

0.292 

0.413 

0.274 

0.287 

0.413 

0.413 

21 

0.277 

0.279 

0.292 

0.413 

0.273 

0.287 

0.413 

0.412 

22 

0.380 

0.374 

0.363 

0.547 

0.445 

0.436 

0.669 


23 

0.380 

0.374 

0.363 1 

0.547 

0.334 

0.370 

0.447 

0.447 

24 

0.380 

0.374 

0.363 

0.547 

0.442 

0.436 

0.669 

0.669 

25 

0.380 

0.374 

0.363 

0.547 

0.336 

0.370 

0.447 

0.447 

Wtib 

91.13 

87.90 

86.94 

106.97 

89.94 

88.95 

113.69 

113.68 

Max. 

DaR. in. 

2.288 

2.305 

2.311 

3.489 

2.486 

2.453 

a614 

a615 


2 and 5 are given, respectively, in Columns 7 
and 8 of Table 5-17. Computations were also 
carried out by removing members 2 and 3, 
and members 3 and 4 along with members 1, 
12, 13, 15, and 17. Results obtained in these 
cases were the same as those shown in 
Columns 7 and 8 of Table 5-17. For this 
reason, these results are not reproduced here. 
Finally, another determinate structure, ob- 
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tained by removing members 1, 2, 5, 15, 16, 
19, and 20 was optimized. The cross-sectional 
areas of various members at the optimum 
point were as foUows: 3,4(0.100); 6 to 
9(0.779); 10,11 (0.182); 12,13(0.446); 
14,17(0.302); 18,21(0.775); 22,25(0.537); 
and 23,24(0.751). The optimum weight in 
this case was 118.1 lb and the maximum 
deflection at this point was 3.861 in. 
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TABLE 6-18 

OPTIMUM TRANSMISSION TOWERS WITH ALL CONSTRAINTS 
(EXAMPLE 5-8) 


El. 

Final Area, in.^ 









No. 

1 

2 

3 

4 

5 

6 

7 

8 

1 

0.010 

0.010 

— 

— 

0.010 

— 

— 

— 

2 

2.092 

2.339 

2.393 

- 

2.263 

2.389 

0.548 

- 

3 

2.075 

2.386 

2.404 

0.548 

2.264 

2.384 

0.548 

0.548 


2.096 

2.339 

2.393 

0.548 

2.021 

1.826 

- 

0.548 


2.083 

2.385 

2.404 

- 

1.920 

1.915 

- 

- 


2.367 

2.085 

2.076 

7.132 

2.389 

2.452 

6.596 

6.699 


2.354 

2.084 

2.076 

6.857 

2.186 

2.042 

6.483 

6.296 

8 

2.350 

2.113 

2.083 

6.895 

2.411 

2.430 

6.596 

6.686 

9 

2.335 

2.112 

2.082 

7.101 

2.096 

2.123 

6.476 

6.471 

10 

0.035 

- 

- 

- 

0.666 

0.621 

2.102 

2.054 

11 


- 

- 

- 

0.658 

0.630 

2.102 

2.047 

12 

0.087 

- 

- 

- 

0.090 

- 

- 

- 

13 


- 

- 

- 

0.071 

- 

- 

— 

14 

1.113 

1.114 

1.139 

1.786 

1.461 

1.485 

4.172 

4.101 

15 

1.113 

1.114 

1.139 

1.735 

— 

- 


- 

16 

1.112 

1.117 

1.146 

1.727 

1.438 

1.498 

4.170 

4.167 

17 

1.112 

1.117 

1.146 

1.798 

- 

- 

- 

- 

18 

2.056 

2.047 

2.027 

4.317 

2.161 

2.171 

4.692 

4.645 

19 

2.058 

2.034 

2.022 

4.390 

2.158 

2.173 

4.692 

4.664 

20 

2.046 

2.047 

2.027 

4.400 

2.403 

2.538 

4.985 

5.108 

21 

2.068 

2.034 

2.022 

4.328 

2.415 

2.524 

4.989 

5.038 

22 

2.822 

2.878 

2.886 

5.655 

4.187 

4.035 

6.746 

6.909 

23 

2.808 

2.878 

2.886 

5.730 

2.915 

2.873 

5.086 

4.781 

24 

2.803 

2.926 

2.895 

5.743 

4.124 

4.086 

6.743 

7.039 

25 

2.786 

2.926 

2.895 

5.648 

2.908 

2.881 

5.086 

4.749 

Wt, lb 

590.32 

596.64 

597.82 

1060.6 

625.37 

626.70 

1142.7 

1139.9 



MU 








0.350 


0.350 

0.350 

0.350 

0.350 

0.350 

0.350 


All these tower configurations were also 
optimized by imposing all constraints; i.e., 
stress, displacement, buckling, and natural 
frequency. The results of these cases are given 
in Table 5-18. Curves 1 to 4 of Fig. 5-42 show 
the variation of cost function with respect to 
the iteration number for results of Columns 1 
to 4 of Table 5-18. It can be observed from 
the results of Table 5-18 that, for the case in 
which all constraints were imposed, the opti¬ 


mum weight of the tower increased as more 
redundant members were removed from the 
structure. 

5-7.4 INTERACTIVE COMPUTING CON¬ 
CLUSIONS 

Computing times for this interactive com¬ 
puting approach are considerably shorter than 
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Figure &41, Iteration vs y\feight Curves for 
Example 5-8, Transmission 
Tower }Nith Stress Constraints 
Only 

had been experienced when the same prob¬ 
lems were solved in the batch mode. Second, 
and probably more significant, interactive 
computing allows the designer to alter the 
structural configuration in a systematic way 
to seek the global optimum design. This is not 
to say that a mathematically precise method 
of obtaining a global optimum has been 
found, for no such method is known. It 
appears, however, that the technique pre¬ 
sented here makes strong use of the designer’s 
knowledge and intuition, and gives him a tool 
with which to seek a global optimum in an 
organized way. 

The results presented for the two examples 
solved in par. 5-7 are of interest in their own 



Figure 5-42. iteration vs Weight Curves for 

Example 5-8, Transmission Tower 
With AH Constraints 


right. For the case when only stress con¬ 
straints are imposed, results of Table 5-15 
indicate that minimum weight designs for 
trusses with multiple loading may be statically 
determinate. However, the results of the 
second example given in Table 5-17 indicate 
that all statically determinate trusses may not 
be lighter than the indeterminate trusses. 

For the case when all constraints are 
imposed, results of Tables 5-15 and 5-18 show 
that statically indeterminate trusses are lighter 
than the determinate trusses. 
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CHAPTER 6 

THE CALCULUS OF VARIATIONS AND OPTIMAL PROCESS THEORY 


6-1 INTRODUCTION 

The problems of Chapters 2 through 5 arc 
all optimal design problems in which the 
design variables were elements of /?", i.e., a 
vector of n real numbers uniquely specified 
the design of the system being investigated. In 
many important, real-world, optimal design 
problems the design of a system cannot be 
specified so easily. For example, the thrust 
vector acting on a rocket during takeoff must 
be continuously oriented in time so that the 
rocket remains stable and follows a certain 
path. In this example, the angles the thrust 
vector makes with the rocket must be speci¬ 
fied at each instant of time during takeoff. It 
is clear that a function specifies the thrust 
direction rather than a finite number of 
parameters. 

Examples of this kind of problem abound 
in the aircraft guidance literature and in the 
optimal control literature. Typical design or 
control variables in these problems are thrust, 
motor torque, control surface settings, etc. 
All these variables must be specified through¬ 
out the entire interval of time an aircraft is in 
the air. Similar problems arise in the presently 
developing field of optimal structural design. 
In this field the design variables are generally 
variables that describe the distribution of 
material in structural elements. 

In order to illustrate the kind of problem 
to be treated in this chapter, two classic 
examples will be given. 


Example 6-1: The shortest path between 
two points, and in the t-x 

plane is to be found. As shown in Fig. 6-1, the 
particular path chosen between the two 
points has a length associated with it. The 
problem is to choose the curve jc(r), r® < r < 
which has the shortest length. For a 
smooth curve x{t) the length is given by 

Note that in this example the quantity J(x) 
to be minimized is a real number once the 
function x(/), r® < r < /Ms chosen. In this 
sense Jix) is a real valued function of a 
function or curve. 

Example 6-2: (The Brachistochrone): Giv¬ 
en two points (/®,x®) and (r* ) in a vertical 

plane that do not lie on the same vertical line, 
find a curve x(r), r® < r < rejoining them so 
that a particle starting at rest will traverse the 
curve without friction from one point to the 



Figure 6-1. Shortest Path 
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Other in the shortest possible time. Candidate 
curves are shown in Fig. 6-2. 



Figure 5-2. Curve for Minimum Time 


Let m be the mass of the particle and g be 
the acceleration due to gravity. Since the 
particle starts at rest at ) and there is no 
friction, 

= mg{x - X®) (6-2) 

where v is velocity, 



where r is time. Solving Eq. 6-2 for v, 
substituting this into Eq. 6-3 and solving for 
dr yields 



[2g(x - 


The total time T required for the particle to 
remove from (/®,x®) to (/^x^) is then 

6-2 



(6-4) 

This notation makes it clear that T depends 
on the entire curve transversed by the parti¬ 
cle. The Brachistochrone problem, therefore, 
is reduced to finding a curve x(r), r® < r < r*, 
that passes through the two given points and 
makes T as small as possible. 

In Examples 6-1 and 6-2 it is clear that a 
curve, or equivalently a function character¬ 
izing the curve, is to be found as the solution 
of the optimization problem. Further, the real 
valued quantities to be minimized are deter¬ 
mined by curves or the functions character¬ 
izing those curves. These real valued quanti¬ 
ties, therefore, are functions of functions. 
Such a real valued function is called a 
functional. The functional notation J{x) in 
Eqs. 6-1 and 6-4 is then interpreted as a real 
valued function of the function x(0.t® < t < 
r*. The most common kind of functional 
encountered in calculus of variations is the 
integral. 

The optimization problem considered here 
might be stated as: find the function x(0, f® 

< r < that minimizes the functional /(x). 
A glance at the functionals defined in Eqs. 6-1 
and 6-4 reveals a basic Haw in this statement 
of the optimization problem. In both cases, 
the functionals are defined only if the func¬ 
tion x(t) has an integrable derivative on r® < / 

< , i.e., it doesn’t make sense to admit all 

functions as candidates for an extremum. The 
problem is more reasonably stated: find the 
function x(r) r® < r < , in a class of 

functions D, that minimizes the functional 
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y(x). The admissible class of functions here 
plays a role similar to the constraint sets of 
Chapters 3,4, and 5. 

The idea of classes of functions required 
here is basic to the mathematical field called 
Functional Analysis. Classes of functions in 
this field are called function spaces. Consider, 
for example, the collection of all continuous 
functions x(r) on 0 < t < 1. The graphs of 
several such functions are shown in Fig. 6-3. 



Figure 6-3. Examples of Continuous Functions 

It is clear that there are infinitely many 
continuous functions but that not all func¬ 
tions are contained in this class. For example 


spaces may be described in a similar manner 
as 

C^(fl,b) = {x(t\a <t<h\ x(t) has / 

continuous derivatives |. (6-6) 

It should be understood here that x(t) may be 
a vector valued function and the differ¬ 
entiability requirement in Eq. 6-6 refers to 
each component. 

Function spaces may be thought of as sets 
of elements, where elements in the function 
space are reaUy curves or functions. In this 
way the problem of minimizing J(x) may be 
viewed as picking the element (curve) in the 
appropriate function space that makes J(x) as 
small as possible. This approach makes mini¬ 
mization of a functional sound very similar to 
the programming problems of Chapter 2. With 
this mental analogy one may begin his study 
of the calculus of variations armed with a 
powerful intuitive tool. 

The basic ideas of function space theory 
are presented very clearly in Ref. 1, Chapter 
2 . 


x(r) = 


0 , 

1 , 


0< r < 1/2 

1/2 < r < 1 


is not continuous so it is not in the class. 


To expedite the development that follows, 
some notation will be introduced. The collec¬ 
tion of continuous functions on 0 < t < 1 
described previously is called a function space 
and is denoted 


In connection with vector spaces, it is often 
necessary to require that a function is small, 
or near the zero function. For this purpose it 
is required that size of a function be defined. 
This is done by defining a norm as a function¬ 
al llxll on the function space of interest with 
the following properties: 

IIX II > 0, II X II = 0 implies x is the zero 

function (6-7) 


c«(o,i)= {x(r),o< 1 ua) 

is continuous } . (6-S) 


II ax II = II a II IIX II for real a (6-8) 

IU+>'||< llxll + lij'll. (6-9) 


A large number of important function 


Examples of norms include 


6-3 


Digitized by L^ooQle 



AMCP 706-192 


II jc II = max I x(r) | 
forxeC®(r®,/*)and 

ff'' T 

11x11= y x'^{t)dt 


( 6 - 10 ) 


( 6 - 11 ) 


for square integrable functions x(r). For a 
discussion of the basic ideas of functional 
analysis as they apply to optimization theory 
the reader is referred to Ref. 1. 


With the idea of norm defined, one can 
speak of relative minima of functionals. The 
functional J{x) has a relative minimum at 
JceD if there is a 6 > 0 such that 

m < J{x) 
for all xeD with 

llx - X II < 6 . (6-12) 

This simply says that J{x) has a minimum in a 
sufficiently small neighborhood of x. It is 
interesting to look at a neighborhood of a 
curve in C®(r®,rM where norm is defined by 
Eq. 6-10. In this case Eq. 6-12 simply 
demands that x(r) be within 6 of x(t) for all t 
in r® < r < The neighborhood of jc in this 
case is simply the collection of all continuous 
curves which can be drawn between jc(r) + 6 
and jc(/) — 6, as shown in Fig. 6-4. 

The present chapter will be devoted almost 
exclusively to the theory of the calculus of 
variations and optimal process theory. Con¬ 
structive methods for these problems will be 
treated in the chapters to follow. A knowl¬ 
edge of this basic theory is essential for 
successful application of the theory of opti¬ 
mal design. It has been the experience of the 
author that most real-world problems require 

6-4 



Figure &4, A Neighborhood of t) 


some modification of the basic optimization 
problems. Without a thorough knowledge of 
the theory, the designer will probably have no 
idea of how to modify the existing theory to 
suit his purposes. 

6-2 THE FUNDAMENTAL PROBLEM OF 
THE CALCULUS OF VARIATIONS 


Examples 6-1 and 6-2 have features in 
common that allow for the formulation of an 
entire class of problems containing these two. 
For the sake of generality, let the variable x(t) 
be a vector valued function of the real 
variable t, i.e.. 



(6-13) 


where x.{t) are real valued functions of t. 

The problem considered here may be for¬ 
mulated as Definition 6-1. 


Definition 6-7 (Fundamental Problem of 
the Calculus of Variations): Find a function 
x(t) in C^(r®,r‘) which satisfies 
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= xf, for some indices 1 < / < n 
Xj(t^ )-Xj , for some indices 1 < / < n 

(6-14) 


and which minimizes 

/(x)= [ F(t,x,x^)dt (6-15) 

Jr® 

where F is a real valued function of all its 
arguments and 


dt 

^ dt ^ 


(6-16) 


such candidate that must then be the solu¬ 
tion. If there are several candidates, other 
methods must be used to choose the solution. 
This problem will be discussed later. 

Graphically, the method of obtaining con¬ 
ditions on the solution, jc(r), of the funda¬ 
mental problem will be to allow small changes 
in jc(r) and examine the behavior of /(x). An 
admissible, small perturbation is illustrated in 
Fig. 6-5. The equation for this curve is jc(/) + 





x(t) 


Figure 6-5. Perturbation from Optimum 


If the reader wishes he may consider x(r) as 
being a real valued function of /, the general¬ 
ization to vector valued functions is simply a 
matter of notation. The conditions, Eq. 6-14, 
^>ecify some or all of the components of x(r) 
at the end points of the interval r® < r < r*. 
This corresponds to demanding that the 
curves in Examples 6-1 and 6-2 pass through 
given points. 

6-2.1 NECESSARY CONDITIONS FOR 
THE FUNDAMENTAL PROBLEM 

Only necessary conditions for solution of 
the fundamental problem of Def. 6-1 will be 
developed here, i.e., the existence of a solu¬ 
tion, jc(r), inC^(t®,t^) first will be assumed. A 
set of conditions that jc(r) must satisfy then 
will be derived. These conditions then may be 
employed in particular problems to find 
functions x(t) that are candidate solutions of 
the problem. Hopefully, there will be just one 


erjit) where e is a small real number and T?(r) 
is any member of ) such that 

) = 0, for i with x^(/®) = \ 

} (6-17) 

T?y(r‘) = 0, for/ with Xy(rM = ^/ ) 


To examine the effect of this perturbation 
of /(x), substitute x + er? in Eq. 6-15, 

/(x + ei7) = f Fit, X + ei7. x' + er\*)dt. 

(6-18) 

Recall that x(r) is a local minimum of /(x) 
subject to Eq. 6-14, i.e., any small change in 
x(r) increases /(x). For any given function 
i7(/) in C^(r®,r') and satisfying Eq. 6-17, x(0 
+ €17(0 is in C^it^j^) and satisfies Eq. 6-14 

6-5 
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for all e. Therefore, for this 17 ( 0 , + € 17 ) is a 

real valued function of e. Further, for e = 0, 
J{x + ei 7 ) has a relative minimum and it is 
assumed that F(t,x,x*) is twice continuously 
differentiable in x and x' so that J{x + € 17 ) is a 
twice continuously differentiable function of 
e. Theorem 2-2 then applies, and it is required 
that 

^/(jr + eij) =0. (6-19) 

oe |e«0 

The object now is to transform condition, 
Eq. 6-19, into conditions on Performing 
the differentiation indicated in Eq. 6-19, 



where the arguments in the partial derivatives 
of / in Eq. 6-20 are jc(0 and x'(r). It is 
important to remember that Eq. 6-20 is 
required to hold for any 17(0 in 
which satisfies Eq. 6-17. 


Integrating the second term in the 
integrand of Eq. 6-20 yields 



dt 3x'/ 


ridt 


), )] ‘) ( 6 - 21 ) 




be treated independently, i.e., each is required 
to be zero. One of the nuyor results which 
follows is a direct application of Lemma 6-1. 

Lemma 6-1: If M{t) is a continuous func¬ 
tion on r® < r < r* and if 

V' 

I Af (0 n (0 dt = 0 ( 6 - 22 ) 

for all 17(0 in (r®, ) with ) = i/ff*) = 0 , 

then ^(0 = 0 , r® < r < 

The ideas involved in the proof are easily 
seen graphically. In Fig. 6-6 a point f*, f® < 



r* < is shown where MJit *) ^ 0. The curve 
is then constructed so that neither 
function is zero in the interval a < t < b. 
Their integral over the entire interval is then 
nonzero which is a contradiction of Eq. 6-22, 
so = 0 . 

Since the two terms in Eq. 6-21 must each 
be zero, 

j ^0 


Since the behavior of 17 inside the interval r® 
< r < r* and at its ends are independent, the 
integral and boundary terms in Eq. 6-21 may 

6-6 


for all 77(0 in ). In any subinterval of 

r® < r < where i(r) is continuously 
differentiable, the quantity 
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[dF/dx - d/dt {dF/3x')] is continuous. 
Therefore Lemma 6-1 implies 


3x dt\hxj 


(6-24) 


in that subinterval. 


If, however, jc'(0 has a jump discontinuity 
at some point i then [dF/dx - djdt (3F/dx')] 
need not be continuous at i and Lemma 6-1 
may not be applied over any subinterval 
containing i. Since £q. 6-24 must hold in 
subintervals on both sides of f, this equation 
may be integrated from f - 6, 6 > 0, to r to 
obtain 


3F 

dx' 


c. 


— dt + C 
3x 


(6-25) 


The vector 3F/3x is piecewise continuous so 
the right-hand side of Eq. 6-25 is continuous. 
Therefore 3F/3x' is continuous even at i. 


These results may be stated in the form of 
a theorem. 


Theorem 6-1: The following conditions 
must be satisfied by the solution of the 
problem of Def. 6-1, x(r), whose derivative is 
piecewise continuous; 


3F 

— [r,i(r),x'(r)] 
ox 

at points of continuity of x(r) 

3F 

— [r>.x(r>).W)lr?(r‘) 

3x 

- [r°.i(r°).x (r®)Jr?(r®) = 0 (6-27) 

3x 


for all i7(f®), i7(r*) satisfying Eq. 6-14, and 

3F 

— [f-0 ,x(f-0 ),x'(f-0)] = 

3x 

^ [f+ 0,x(f+ 0),x'(f+0)] (6.28) 

3x' 

at each point i of discontinuity of x\t). 

Condition, Eq. 6-26, is a second-order 
differential equation in x(r) and is called the 
Euler-Lagrange equation. Condition, Eq. 6-27, 
is called a transversality condition. For each i 
or / such that i7^(/®) or r]y(r*) is not specified 
by Eq. 6-14, Eq. 6-27 implies 3F/3x; (r®) = 0 
or 3F/3xy' (rM = 0. The condition, Eq. 6-28, 
at discontinuities (called corners) in x'(r) is 
called the Weierstrass-Erdmann comer condi¬ 
tion. 

One further necessary condition will be 
important for further development. Define 
the Weierstrass E-function as 

£(r,x,x',w) = F(r,x,w) -F(r,x,x') 

(6-29) 

3F 

,(r.x.x')(w-x'). 

dx 

The proof of the Weierstrass necessary condi¬ 
tion may be found in Ref. 2, page 149. The 
result only will be given here as Theorem 6-2. 

Theorem 6-2: If the function x(r) is the 
solution of the problem of Def. 6-1, then it is 
necessary that 

F(f,x(0,i'(0. vv] > 0 (6-30) 

for all r® < / < r* and all finite w. 

The Weierstrass condition of Theorem 6-2 
generally is not used to generate candidate 
solutions of the fundamental problem. 

6-7 
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Rather, when solutions of Eq. 6-26 are 
determined, Eq. 6-30 is used to eliminate 
unsuitable functions, i.e., it may very well 
disqualify a function which satisfies Eq. 6-26. 

The derivation of necessary conditions for 
the fundamental problem is only very lightly 
covered here. Further, the theory of sufficient 
conditions is completely neglected. For out¬ 
standing and complete treatments of these 
topics see Refs. 2, 3,4, and 5. 

6-2.2 SPECIAL CASES AND EXAMPLES 


This is simply an algebraic equation be¬ 
tween t and X. Since there will be no 
constants of integration, it will not generally 
be possible to pass the resulting curve through 
particular points. This means that a solution 
to such a problem generally will not exist. 

Example 6-3: Minimize 
1 

x^dt 

0 
for 


In many problems the form of the function 
P(t,xy) allows for simplification for the 
Euler-Lagrange equation, Eq. 6-26. In any 
case, Eq. 6-26 may be written, using the chain 
rule of differentiation and the notation 


x(0) = 0, x(l)=l. 

The condition, Eq. 6-33, is 


2x = 0. 


3F 


dF 






dtbx’ 


and 


But it is, therefore, impossible to satisfy x(l) 
= 1 so the problem has no solution. 


F , = 

XX 


bx'bx 


to obtain 


Px-Px; 


x'^ F^. 

^ * XX 




To get an idea of what has gone wrong, 
note that since x^{t) > 0 for each r, 


1 



(t)dt > 0 


= 0. (6-31) 

This is simply a second-order differential 
equation for x{t). 

Several special cases with examples will 
now be considered. 

Case 1 . F does not depend on x': 

F = F(r.x). (6-32) 

Eq. 6-31 in this case is 

F^(r.x) = 0. (6-33) 

6-8 


for any curve on 0 < r < 1. It is, therefore, 
clear that if there were a curve which mini¬ 
mized J^x^dt, then the minimum value of 
the integral would be non-negative. 

It was noted that no minimum exists. 
However, consider the family of curves 

These curves all satisfy the end conditions and 

^ o 
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the design of the system being investigated. In 
many important, real-world, optimal design 
problems the design of a system cannot be 
specified so easily. For example, the thrust 
vector acting on a rocket during takeoff must 
be continuously oriented in time so that the 
rocket remains stable and follows a certain 
path. In this example, the angles the thrust 
vector makes with the rocket must be speci¬ 
fied at each instant of time during takeoff. It 
is clear that a function specifies the thrust 
direction rather than a finite number of 
parameters. 

Examples of this kind of problem abound 
in the aircraft guidance literature and in the 
optimal control literature. Typical design or 
control variables in these problems are thrust, 
motor torque, control surface settings, etc. 
All these variables must be specified through¬ 
out the entire interval of time an aircraft is in 
the air. Similar problems arise in the presently 
developing field of optimal structural design. 
In this field the design variables are generally 
variables that describe the distribution of 
material in structural elements. 

In order to illustrate the kind of problem 
to be treated in this chapter, two classic 
examples will be given. 


Example 6-1: The shortest path between 
two points, and in the t-x 

plane is to be found. As shown in Fig. 6-1, the 
particular path chosen between the two 
points has a length associated with it. The 
problem is to choose the curve Jc(r), r® < r < 
r* which has the shortest length. For a 
smooth curve x(r)the length is given by 

1/2 

dt (6-1) 

Note that in this example the quantity J(x) 
to be minimized is a real number once the 
function x(r), r® < r < r* is chosen. In this 
sense /(x) is a real valued function of a 
function or curve. 

Example 6-2: (The Brachistochrone): Giv¬ 
en two points (/®,x®) and (r* ,x^) in a vertical 
plane that do not lie on the same vertical line, 
find a curve x(r), < t < joining them so 

that a particle starting at rest will traverse the 
curve without friction from one point to the 



Figure &1. Shortest Path 
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Other in the shortest possible time. Candidate 
curves are shown in Fig. 6-2. 



Figure &2. Curve for Minimum Time 


Let m be the mass of the particle and g be 
the acceleration due to gravity. Since the 
particle starts at rest at and there is no 

friction, 

■jmv^ =mg(x-x'>) (6-2) 

where v is velocity, 



where r is time. Solving Eq. 6-2 for v, 
substituting this into Eq. 6-3 and solving for 
dr yields 



I2g(x - jc®)] 


The total time T required for the particle to 
remove from to is then 

6-2 



(6-4) 

This notation makes it clear that T depends 
on the entire curve transversed by the parti¬ 
cle. The Brachistochrone problem, therefore, 
is reduced to finding a curve jc(r), r® < r < r*, 
that passes through the two given points and 
makes T as small as possible. 

In Examples 6-1 and 6-2 it is clear that a 
curve, or equivalently a function character¬ 
izing the curve, is to be found as the solution 
of the optimization problem. Further, the real 
valued quantities to be minimized are deter¬ 
mined by curves or the functions character¬ 
izing those curves. These real valued quanti¬ 
ties, therefore, are functions of functions. 
Such a real valued function is called a 
functional. The functional notation J(x) in 
Eqs. 6-1 and 6-4 is then interpreted as a real 
valued function of the function x(t)j^ < t < 
r^. The most common kind of functional 
encountered in calculus of variations is the 
integral. 

The optimization problem considered here 
might be stated as: find the function x(r), r® 

< t < 0 , that minimizes the functional J{x). 
A glance at the functionals defined in Eqs. 6-1 
and 6-4 reveals a basic Haw in this statement 
of the optimization problem. In both cases, 
the functionals are defined only if the func¬ 
tion x(t) has an integrable derivative ont^ < t 

< 0 , i.e., it doesn’t make sense to admit all 
functions as candidates for an extremum. The 
problem is more reasonably stated: find the 
function jc(r) r® < t < in a class of 
functions D, that minimizes the functional 
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J(x). The admissible class of functions here 
plays a role similar to the constraint sets of 
Chapters 3,4, and 5. 

The idea of classes of functions required 
here is basic to the mathematical field called 
Functional Analysis. Classes of functions in 
this field are called function spaces. Consider, 
for example, the collection of all continuous 
functions jc(r) on 0 < t < 1. The graphs of 
several such functions are shown in Fig. 6-3. 



Figure 6-3, Examples of Continuous Functions 


It is clear that there are infinitely many 
continuous functions but that not all func¬ 
tions are contained in this class. For example 


x(r) = 


0 , 

1 , 


0< r < 1/2 
1/2 < r < 1 


is not continuous so it is not in the class. 


To expedite the development that follows, 
some notation will be introduced. The collec¬ 
tion of continuous functions on 0 < / < 1 
described previously is called a function space 
and is denoted 

C»(0,1)= {x(O.0< 1 U(r) 

is continuous } . (6-S) 

A large number of important function 


spaces may be described in a similar manner 
as 


C\a,b) = {x(r), a< t < b\ x(t) has i 

continuous derivatives |. (6-6) 

It should be understood here that jc(r) may be 
a vector valued function and the differ¬ 
entiability requirement in Eq. 6-6 refers to 
each component. 

Function spaces may be thought of as sets 
of elements, where elements in the function 
space are really curves or functions. In this 
way the problem of minimizing Jix) may be 
viewed as picking the element (curve) in the 
appropriate function space that makes Jix) as 
small as possible. This approach makes mini¬ 
mization of a functional sound very similar to 
the progranuning problems of Chapter 2. With 
this mental analogy one may begin his study 
of the calculus of variations armed with a 
powerful intuitive tool. 

The basic ideas of function space theory 
are presented very clearly in Ref. 1, Chapter 
2 . 


In connection with vector spaces, it is often 
necessary to require that a function is smaU, 
or near the zero function. For this purpose it 
is required that size of a function be defined. 
This is done by defining a norm as a function¬ 
al ||x|| on the function space of interest with 
the following properties: 


IIX II > 0, IIX II = 0 implies x is the zero 


function 

(6-7) 

II ax II = II a II II X II for real a 

(6-8) 

lU+y ||< ||x|| + ||y II. 

(6-9) 

Examples of norms include 



6-3 

Digitized by 

Googl 



AMCP 706-192 


lU II = max 1 xU) | (6-10) 

forxG C®(r®, r*) and 

” 1/2 

xHOdt (6-11) 

for square integrable functions x(/). For a 
discussion of the basic ideas of functional 
analysis as they apply to optimization theory 
the reader is referred to Ref. 1. 

With the idea of norm defined, one can 
speak of relative minima of functionals. The 
functional Jix) has a relative minimum at 
jcgZ) if there is a 6 > 0 such that 

m < J(x) 
for all xeD with 

IIJc -X II < 6 . (6-12) 

This simply says that J(x) has a minimum in a 
sufficiently small neighborhood of x. It is 
interesting to look at a neighborhood of a 
curve in C®(r®,r') where norm is defined by 
Eq. 6-10. In this case Eq. 6-12 simply 
demands that x(t) be within 6 of xit) for all t 
in r® < r < The neighborhood of x in this 
case is simply the collection of all continuous 
curves which can be drawn between Jc(r) + 6 
and jc(r) — 6, as shown in Fig. 6-4. 

The present chapter will be devoted almost 
exclusively to the theory of the calculus of 
variations and optimal process theory. Con¬ 
structive methods for these problems will be 
treated in the chapters to follow. A knowl¬ 
edge of this basic theory is essential for 
successful application of the theory of opti¬ 
mal design. It has been the experience of the 
author that most real-world problems require 

6A 



Figure &4, A Neightxyrhood of x(t) 


some modification of the basic optimization 
problems. Without a thorough knowledge of 
the theory, the designer will probably have no 
idea of how to modify the existing theory to 
suit his purposes. 

6-2 THE FUNDAMENTAL PROBLEM OF 
THE CALCULUS OF VARIATIONS 

Examples 6-1 and 6-2 have features in 
common that allow for the formulation of an 
entire class of problems containing these two. 
For the sake of generality, let the variable x(r) 
be a vector valued function of the real 
variable t, i.e., 

x(0= (6-13) 

where x^.(r) are real valued functions of t. 

The problem considered here may be for¬ 
mulated as Definition 6-1. 

Definition 6-7 (Fundamental Problem of 
the Calculus of Variations): Find a function 
xit) in C^(r®,r*) which satisfies 
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Xfit^) = for some indices \ < i< n 

Xj(t^ )-xj, for some indices 1 < / < /i 

(6-14) 


and which minimizes 

f 

y(x)= I F{t,x,x*)dt (6-15) 

Jr® 

where F is a real valued function of all its 
arguments and 


dr 

- dt - 


(6-16) 


such candidate that must then be the solu¬ 
tion. If there are several candidates, other 
methods must be used to choose the solution. 
This problem will be discussed later. 

Graphically, the method of obtaining con¬ 
ditions on the solution, jc(r), of the funda¬ 
mental problem will be to allow small changes 
in Jc(r) and examine the behavior of J(x). An 
admissible, small perturbation is illustrated in 
Fig. 6-5. The equation for this curve is jc(r) + 



If the reader wishes he may consider x(r) as 
being a real valued function of r, the general¬ 
ization to vector valued functions is simply a 
matter of notation. The conditions, Eq. 6-14, 
specify some or all of the components of x(r) 
at the end points of the interval r° < r < r^. 
This corresponds to demanding that the 
curves in Examples 6-1 and 6-2 pass through 
given points. 


6-2.1 NECESSARY CONDITIONS FOR 
THE FUNDAMENTAL PROBLEM 

Only necessary conditions for solution of 
the fundamental problem of Def. 6-1 will be 
developed here, i.e., the existence of a solu¬ 
tion, x(r), in ) first will be assumed. A 

set of conditions that x(t) must satisfy then 
will be derived. These conditions then may be 
employed in particular problems to find 
functions x(r) that are candidate solutions of 
the problem. Hopefully, there will be just one 


617(0 where e is a small real number and v(0 
is any member of ) such that 


) = 0, for / with x^(t ^) = x^° 

i7^(r‘) = 0, for / with x^(r') = x^' 


(6-17) 


To examine the effect of this perturbation 
of /(x), substitute x + ery in Eq. 6-15, 

J(x + 6T?) = I F(f, X + €17. x' + €r{)dt. 

^ f® 

(6-18) 

Recall that x{t) is a local minimum of J{x) 
subject to Eq. 6-14, i.e., any small change in 
x(r) increases /(x). For any given function 
17(0 in C^(r°,f') and satisfying Eq. 6-17, x(0 
+ €17(0 is in C^(f®,f‘/ and satisfies Eq. 6-14 
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for all €. Therefore, for this t}(0, J(Sc + €17) is a 
real valued function of e. Further, for e = 0, 
y(jc + er?) has a relative minimum and it is 
assumed that F(t,x,x*) is twice continuously 
differentiable in x and jc' so that J(x + C17) is a 
twice continuously differentiable function of 
6. Theorem 2-2 then applies, and it is required 
that 

^/(Jc+eij) =0. (6-19) 

de ie*o 


The object now is to transform condition, 
Eq. 6-19, into conditions on Jc(r). Performing 
the differentiation indicated in Eq. 6-19, 



where the arguments in the partial derivatives 
of / in Eq. 6-20 are jc(r) and Jc'(f). It is 
important to remember that Eq. 6-20 is 
required to hold for any iKO in 
which satisfies Eq. 6-17. 


Integrating the second term in the 
integrand of Eq. 6-20 yields 



dt 


ridt 


+1^ [r',i(r').W)]T?(r') (6-21) 




be treated independently, i.e., each is required 
to be zero. One of the m^or results which 
follows is a direct application of Lemma 6-1. 

Lemma 6-7; If M(t) is a continuous func¬ 
tion ont^ < t < and if 

( M{t) r?(f) dr = 0 (6-22) 

for all 17(0 in f*) with iKf®) = = 

thenAf(r) = 0, < t < . 

The ideas involved in the proof are easily 
seen graphically. In Fig. 6-6 a point r*, r® < 



r* < , is shown where Af^(r*) ^ 0. The curve 

T7^(r) is then constructed so that neither 
function is zero in the interval a < t < b. 
Their integral over the entire interval is then 
nonzero which is a contradiction of Eq. 6-22, 
so = 0. 

Since the two terms in Eq. 6-21 must each 
be zero, 

J fO 


Since the behavior of 17 inside the interval r® 
< t < and at its ends are independent, the 
integral and boundary terms in Eq. 6-21 may 

6-6 


for all rj(t) in ). In any subinterval of 

< r < where x(t) is continuously 
differentiable, the quantity 
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[dF/3jc - didt (3F/3x')] is continuous. 
Therefore Lemma 6-1 implies 


djc 



(6-24) 


in that subinterval. 


If, however, i'(f) has a jump discontinuity 
at some point f then [3F/3x - djdt (3F/3x')l 
need not be continuous at i and Lemma 6-1 
may not be applied over any subinterval 
containing i. Since Eq. 6-24 must hold in 
subintervals on both sides of f, this equation 
may be integrated from f - 6, 6 > 0, to r to 
obtain 


3x' 



— dr + C. 
dx 


(6-25) 


The vector dF/dx is piecewise continuous so 
the right-hand side of Eq. 6-25 is continuous. 
Therefore bFIbx* is continuous even at f. 


These results may be stated in the form of 
a theorem. 


Theorem 6-1: The following conditions 
must be satisfied by the solution of the 
problem of Def. 6-1, i(r), whose derivative is 
piecewise continuous; 

3F 

— [r,i(r),i'(r)] 

= 0 (6-26) 

at points of continuity of x(t) 

3F 

bx 

- (r»,x(r®).x'(r®))t?(r®) = 0 (6-27) 


dF 


),i'(f)l I 


for all T/(r®), »}(f*) satisfying Eq. 6-14, and 
bF 

— [f-0,i(f-0),i'(f-0)] = 


^ [f + 0,i(f + 0),jc'(f + 0)1 (6-28) 

3x' 

at each point i of discontinuity of i'(r). 

Condition, Eq. 6-26, is a second-order 
differential equation in i(r) and is called the 
Euler-Lagrange equation. Condition, Eq. 6-27, 
is called a transversality condition. For each i 
or / such that or i 7 ^(r*) is not specified 
by Eq. 6-14, Eq. 6-27 implies bFjbx] (r®) = 0 
or 3F/3 jcJ (rM = 0. The condition, Eq. 6-28, 
at discontinuities (called corners) in x\t) is 
called the Weierstrass-Erdmann comer condi¬ 
tion. 

One further necessary condition will be 
important for further development. Define 
the Weierstrass E-function as 

£(/, jc, x\w)- Fit, X, w) - Fit, X, x') 

(6-29) 

bF 

-—,it,x,x')iw-x'), 

bx 

The proof of the Weierstrass necessary condi¬ 
tion may be found in Ref. 2, page 149. The 
result only will be given here as Theorem 6-2. 

Theorem 6-2: If the function i(f) is the 
solution of the problem of Def. 6-1, then it is 
necessary that 

£[r,Jc(r),jc'(f). w] > 0 (6-30) 

for all f® < r and all finite w. 

The Weierstrass condition of Theorem 6-2 
generally is not used to generate candidate 
solutions of the fundamental problem. 

6-7 
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Rather, when solutions of Eq. 6-26 are 
determined, Eq. 6-30 is used to eliminate 
unsuitable functions, i.e., it may very well 
disqualify a function which satisfies Eq. 6-26. 

The derivation of necessary conditions for 
the fundamental problem is only very lightly 
covered here. Further, the theory of sufficient 
conditions is completely neglected. For out¬ 
standing and complete treatments of these 
topics see Refs. 2, 3, 4, and 5. 

6-2.2 SPECIAL CASES AND EXAMPLES 


This is simply an algebraic equation be¬ 
tween t and X, Since there will be no 
constants of integration, it will not generally 
be possible to pass the resulting curve through 
particular points. This means that a solution 
to such a problem generally will not exist. 

Example 6~3: Minimize 
1 

x^dt 

0 
for 


In many problems the form of the function 
ftt,x,x*) allows for simplification for the 
Euler-Lagrange equation, Eq. 6-26. In any 
case, Eq. 6-26 may be written, using the chain 
rule of differentiation and the notation 


x(0) = 0, jc(l)=l. 

The condition, Eq. 6-33, is 

2x = 0. 




But it is, therefore, impossible to satisfy x(l) 
= 1 so the problem has no solution. 




dx'dx 


to obtain 


F -F . - x"’’’F . ■ 

X X t XX XX 


To get an idea of what has gone wrong, 
note that since x^(t) > 0 for each /, 


(t)dt > 0 

0 


= 0. (6-31) 

This is simply a second-order differential 
equation for x(r). 

Several special cases with examples will 
now be considered. 

Case 1 . F does not depend on x': 

F = F(r,x). (6-32) 


for any curve on 0 < t < 1. It is, therefore, 
clear that if there were a curve which mini¬ 
mized J^x^dt, then the minimum value of 
the integral would be non-negative. 

It was noted that no minimum exists. 
However, consider the family of curves 

(0 = r". 

These curves all satisfy the end conditions and 


Eq. 6-31 in this case is 

F,(r,x) = 0. 

6-8 


(6-33) 





t^^dt 


1 

2 « + 1 ■ 
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Therefore, it is possible to choose n large 

enough so that / x^dt as close as desired 
Jo ” 

to zero. However, the limit of x^(0 as n 
approaches infinity is the function 

( 0, r < 1 

( 1 , ^= 1 . 

and this is not even a continuous function. 
The class of functions (t) are illustrated in 
Fig. 6-7. 



Figure 6-7. Minimizing Sequence 


The function F from Eq. 6-1 is 
F=(1+(x')M''\ 

The form of the Euler-Lagrange equation in 
Eq. 6-35 applies in this case to yield 

- [1 +(Jc')M‘ x" = 0. 

Since > 0, [1 + (jc')^] 0 and x'(0 is 

required to be continuous so [ 1 + ] # <» 

and it, therefore, is required that 

x"(/) = 0 

or 

xit) = at + by 

where a and b are constants. This implies that 
the shortest path between two points in a 
plane is a straight line. This shouldn’t shake 
anyone up. 


In this illustration, a solution of the prob¬ 
lem exists in the class of piecewise continuous 
functions but not in the class of twice 
continuously differentiable functions. This 
problem, therefore, should serve as a warning 
that not all innocent looking calculus of 
variations problems have solutions. 

Case 2. F depends only on x': 

F = F(x'). (6-34) 

Eq. 6-31 is in this case 

F,,^,x" = 0. (6-35) 

Example 6-4: Using the formulation of 
Example 6-1, find the shortest curve in the t-x 
plane which passes through the points (0,0) 
and (1,1). 


The end conditions yield 
x(0) = b = 0 
and 

x(l)=a=l. 

Therefore the solution of the problem is 
x(t) = t. 

Case 3. F depends only on t and x': 

F = Fr/.x7 (6-36) 

Eq. 6-26 is, in this case. 
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or 


This reduces to 




(6-37) 1={ 2pc(1+(x'>»M) ''*C, 


where C is an arbitrary constant. 


or 


Case 4. x is a real valued function, and F 
depends only on x and x': 

F-F(x,x*). (6-38) 

Eq. 6-31 is, in this case. 


x[l+rxVM =Ci 

where C, is a new constant. 

The solution of this differential equation is 
a family of cycloids in parametric form 


F^ - F^.^ X - F ^.^, x" - 0. ^ ^ (5 - sin 5 > 

Multiplying by x' yields 

and 


This is just 


—(F-x'F J = 0, 
at 


Co 

X = — (1 — cos s), 

2 

The constants Ci and C 2 are to be deter¬ 
mined so that the cycloid which passes 
through the given points is fixed. 


so 

F-x'F^.^C (6-39) 

where C is an arbitrary constant. 

Example 6-5: Solve the Brachistochrone 
problem of Example 6-2. 

The function F from Eq. 6-4 is 



Eq. 6-39 applies in this case and yields 

(x'p _ 

2gx J (2gx)''^ [l+(x'P]''^ 

= C. 

6-10 


It should be noted that each of the 
problems treated here reduced to the solution 
of a nonlinear differential equation. This is 
characteristic of problems of the calculus of 
variations. The reader is undoubtedly aware 
that it is only in the simplest cases that closed 
form solutions of these differential equations 
may be obtained. Further, questions of exis¬ 
tence and uniqueness of solutions are by no 
means trivial. 

6-2.3 VARIATIONAL NOTATION AND 
SECOND-ORDER CONDITIONS 

FotJ(x)^ \ F(t,x.x')dt. 

J ^0 

define the first variation of J(x) as 

bJ(x) =-^/(x + €dx)\ (6-40) 

de |e«o 


Digitized by L^ooQle 



AMCP 706-192 


de]^ 


F(t.x + e6x,x' 


+ €dx')dt\ 


+ j dt. 


Note that all this does not require that x(/) 
be the solution of the fundamental problem. 
If, however, x(t) = x(t) is the solution of the 
fundamental problem, then it is clear from 
Eq. 6-19 that it is necessary that 


6J(x) = 0, 


(6A\) 


for all dx(t) for which if + e6jcf t) satisfy the 
end conditions in the fundamental problem. 

In a way quite similar to the definition of 
the first variation, the second variation may 
be defined as 


b^J(x)^—J(x^€bx) 

de^ e«o 


Performing the differentiation, this is 


b^J(x) 


-11 


F(t,x + €6jc,x' + e6jcV dt\ 


(t,x + ebx,x* + ebx*) 


+ bx'—T + ebxy + ebx) dt 

bx* e-0 




+ 26 x-^, (t,xy)bx^ 


^b^F 1 

bx' ' ^^,2 (f*xy)bx' i 


Define 



With this notation. 


b^J(x) = 1 (6jc^i46x + ojs,^Bbx* 


+ 6x'^C6x') dt. 

If F(t,xy) has three derivatives, then by 
Taylor’s formula 


J(x + ebx) = J(x) + — 
de 

l€«0 

d^J , dV , 

de^ ^ ^ de^ ^ . 
€«o e«€ 


(6-42) 


where 0 < e < e. If we computed d^J/de^, it 
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would involve a sum of terms each containing 
third degree terms in 6 jc and 6 jc'. For 6x and 
sufficiently small, this term may be 
neglected to obtain a second-order approxi¬ 
mation when e = 1 so that 

It is clear then that 6/ and play the 
role of differentials in the theory of func¬ 
tionals. 

Further, if x(t) yields a relative minimum 
for the fundamental problem, then J(x + edx) 
is a relative minimum at € = 0. It is, therefore, 
necessary that 

> 0 . 

€=0 


£J 

de^ 


This is just 
d^J(x) > 0, 


or 




(dx^Adx + dx^Bdx* 


+ 6x'^C6x')dr> 0 (6-43) 


for all dx(t) such that jc + 6jc satisfy the end 
conditions for the fundamental problem. In 
what follows it will be convenient to limit 
bx( t) to those variations which satisfy bx(t^) 
= 6xrf‘>/ = 0. 


If bx*(t) is small for all r, then bx(t) must 
also be small since bx(t^) = 0. On the other 
hand, it is possible to choose bx(t) which is 
zero at the endpoints and small for aU t, but 

6-12 


which has large derivatives. One might, there¬ 
fore, be led to believe that the derivative term 
in the inequality of Eq. 6-43 is dominant. 
This would then require C to be positive 
semi-definite. 

To show that this is the case, assume that 
there is a point t*, f® < f < f‘ and a nonzero 
vector h such that h^C(t*)h = - 2/3 < 0. For 
any continuous bx^(t) such that bx*(t*) = h, 
there is an interval 

t* -a< t < + a > 0, such that 

bx*(tFat)bx*(t) < - j3 < 0 

in - a < t < + 

Define 




h sin 



f*+a 


6x(f) = \ 


y 0, elsewhere 


so that 


6x'(r) = 


Ih cos I-—-—J , f* -a< 


t< f + a 


0, elsewhere 


Now, Eq. 6-43 is 

{dx'^Adx + 6x^B6x' + 6x'^C6x') dt 




■/'"■Is-m- 
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where 


Af=®;* \h’^A{t)h\ 
/>=«n«x \h’^B{t)h\ 


incorrect. This implies hTCh > 0 for all h and 
Therefore C(t) is positive semi-definite. 
Since 


C(i) 


b^F 


this result may be stated as Theorem 6-3. 

Theorem 6-3: A necessary condition for 
the fundamental problem to have a relative 
minimum at ^t) is that 


b^F 

bx^^ 


ltMt).x^(t)] 


be positive semi-definite for all < r < f*. 

Gelfand and Fomin (Ref. 2, p. 104) indi¬ 
cate that people are prone to argue that 
positive definiteness of b^Flbx*^ at each 
point of the solution is a sufficient condition 
for an extremum. They point out, however, 
that this is not the case and, in fact, that no 
local condition can provide sufficient condi¬ 
tions. For a treatment of sufficient conditions 
sec Refs. 2, 3,4, and 5. 


Therefore, integration in the preceding in¬ 
equality yields 

However, since a may be chosen arbitrarily 
small and jS > 0, the right side will be negative 
for sufficiently small a. But this is a con¬ 
tradiction. Therefore, the assumption that 
there exist t* and h such that hTCh < 0 is 


6-2.4 DIRECT METHODS 

The direct methods of the calculus of 
variations seek to generate a sequence of 
functions [x^^^ ft)] such that, if { is the 
infimum of J(x) over all admissible x, then 

(6-44) 

/!-►<» 

Direct methods are capable of showing 
existence of solution as well as construction 
of approximations of the solution. It is 
generaUy very difficult to prove existence of a 
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solution of the nonlinear boundary-value 
problem in the necessary conditions for the 
fundamental problem. It is often possible, 
however, to show that the sequence (t)] 

converges to a function ^t) which is the 
solution of the fundamental problem, i.e.. 


where are constants. Classical, trigo¬ 
nometric Fourier series is an example of this 
kind of representation. 

In the Ritz Method, the nth function in the 
minimizing sequence is formed by 


lim 




(6^5) 


n 


/-I ' * 


( 6 ^ 6 ) 


It is clear, however, that a sequence which 
satisfies Eq. 6-44 may very well fail to 
converge to an admissible function if/A This 
must necessarily be the case if no solution of 
the fundamental problem exists. From an 
engineering point of view, one may not be too 
concerned with existence of a limit of the 
sequence (t)]. Provided it is possible to 
successively reduce J, consistently better re¬ 
sults are being obtained and the process will 
be continued until no further meaningful 
reduction in J may be achieved. For an 
outstanding treatment of convergence of di¬ 
rect methods, see Ref. 2, page 192, and Ref. 
3, page 127. 

The problem of primary interest to the 
engineer is the construction of a minimizing 
sequence. There are many ways of generating 
such a sequence, only two of which will be 
treated here. These methods are known as the 
Ritz Method and the Method of Finite Differ¬ 
ences. 

6-2.4.1 THE RITZ METHOD 

The Ritz Method is based on the idea of 
representing functions by using linear com¬ 
binations of known functions; i.e., given 0/rA 
I = 1,2,... which preferably form a complete 
set, a function is represented by 

y(t)- X aAkXt) 


where the 4^/0 are chosen so that 
satisfies the end conditions associated with 
the fundamental problem. This expression k 
then substituted into J(x) to obtain 

j[x(»^] = 


2 2 ( 647 ) 

The object now is to choose the coefficients 
Qf, 1=1, ...» n, so that /[x^'*^] is as small as 
possible. For this purpose, it should be noted 
that the right side of Eq. 6-47 is simply a 
function of n parameters. The problem is now 
to minimize this function without any other 
constraints. For this purpose, any of the 
methods of Chapter 2 may be used. 

The property 

follows readily from the method of det^- 
mining the a^. It is clear that by choosing 
- 0, (t) = xf"Ht). However, by 

allowing ci„^i to be nonzero, a larger number 
of functions are available as candidates for 
minimum of than The 

minimum of will, therefore, cer¬ 

tainly not be greater than that of /[x^"^ J and 
this is the desired result. 
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In practice, the rate of convergence of 
lx^"^(t)] depends strongly on the functions 
^/t) chosen. The number of terms required to 
obtain a reasonable approximation of the 
solution is greatly reduced if these functions 
are chosen based on a reasonable engineering 
estimate of the form of the solution. By 
making a judicious choice of the ^/t), a good 
approximation of the solution may be ob¬ 
tained with as few as two or three terms. 


This is a positive definite quadratic form, so it 
has a unique minimum which may be ob¬ 
tained by setting its first derivatives equal to 
zero. This yields 


38 20 

105 ' 



0 


20 2488 

45 ^ 3465 





Example 6-6: In solving the boundary- 
value problem 

x" + (1 + r*)x + 1 = 0 

x(- 1) = x(l) = 0. 

it is necessary to minimize the functional 

J(x) - I [x'* - (1 + r^)x* - 2x1 dt 
^ — 1 

( 6 - 18 ) 

subject to the end conditions x(-1) = x( 1) = 

0 . 


The solution of these equations is 
a, = 0.9877 
a, = - 0.05433. 

Substituting these coefficients into Eq. 
6-49, 

x<*>fr7 =-^^(3969 - 4200r* + 23ir*). 

In particular, 
x<*>{0) = 0.93344. 


In order to minimize J(x) of Eq. 6-48, by 
the Ritz Method, choose 


= (1 - r*'). 

If for a first approximation n = 2 is chosen, 

x<*>fr>=a,(l-x*) + a,(l-X*). (6^9) 

Substituting x^^^ into Eq. 6-48 and integrat¬ 
ing yields 


/rx<*>> = 8 


' 19 , 10 , 


45 


+ 


1244 2 
3465 



If the three term approximation 


x(^>=ai(l -r^) + fl3(i 


is determined in the same manner, 
x<^>(0) = 0.93207. 

This might lead one to believe that both 
x^^^(t) and x^^^(t) are good approximations 
of the solution. 

6-2.4.2 METHOD OF FINITE DIFFER¬ 
ENCES 

The Method of Finite Differences, as its 
name implies, is simply based on the replace- 
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ment of derivatives and the integral by finite 
approximations of these continuous opera¬ 
tions. A grid, = to, fi, .... is 

placed on the interval r® < r < r' and the 
value of x(t) only at the grid points is sought, 
i.e., only the parameters x. = i = 1,...» n, 
and perhaps x(to) or are sought. 


chapters that these side conditions generally 
include both equality and inequality con¬ 
straints. An extension to inequality con¬ 
straints will be given in par. 6-4. 

The problem to be treated here is given in 
Definition 6-2. 


Replacing derivatives by finite differences 
and the integral by a finite sum, the problem 
is to determine the x. which minimize 



The problem is now simply an uncon¬ 
strained minimization problem in a finite 
dimensional space and may be solved by the 
methods of Chapter 2. 


Definition 6-2 (Problem of Bolza): The 
problem of Bolza is a problem of fmding uf 
b. x(t), r® < t < t^, which minimizes 


f r® (6-50) 

fo lt,x(t),u(t),b]dt 

subject to the conditions 


dt 


^f(t,x,u,b), t^ < t< t^ 


(6-51) 


6-3 A PROBLEM OF BOLZA 

6-3.1 STATEMENT OF THE PROBLEM 

Many real-world optimal design problems 
cannot realistically be reduced to the finite 
dimensional form of Chapter 5. In many 
problems the system varies continuously in 
time or space, so functions rather than just 
parameters must be determined. Examples 6-1 
and 6-2, par. 6-1, are extremely simple, yet 
even they involve distribution of the con¬ 
trolling factor over space and time. 

As has been seen in previous chapters, 
optimal design problems involve ideas of 
design variables and state variables. Further, 
since the system being designed must be 
capable of performing certain functions, side 
conditions on the state and design variables 
occur. It has been observed in previous 
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f f” 

^(b,t^,x0+ I 


LAt,x(t),u(t)Mdt 


= 0, Of = 1. r (6-52) 


^^(t,x,u,b) = 0, 


13=1.(6-53) 


where 


x(t) = 


Xi(t) 


xjt) 


u(t) = 


Ui(t) 




b = 


(6-54) 
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f(t,x, u, b) = 


f/t.x.u^b) 
f„(t,x, u,b) 


and f® < < f’*, where (t^,x0 are inter¬ 

mediate points, / = 1,...»t? - 1. 


For the problem considered here it will be 
required that the conditions, Eqs. 6-53, shall 
not determine any component of x(t) ex¬ 
plicitly. This is equivalent to requiring that 
the rank of the matrix 


(t.x.Kb) (6-55) 

V X w 

shall be q for all admissible values of the 
arguments. In case some constraint function 
should depend only on x(t) and t; this 
constraint is called a state variable constraint. 
This kind of constraint will be discussed in a 
later paragraph. 

The vector variable x(t) is called the state 
variable, tdt) ]s called the design (or control) 
variable, and b is called the design (or control) 
parameter. Eqs. 6-52 contain the boundary 
conditionsf on the state variable and functions 
which determine the end points of the inter¬ 
val, r® and f’*. The independent variable t 
may be time or a space-type variable, depend¬ 
ing on the problem being considered. 

The functions/o. /, , and are assumed 

to be continuously differentiable at all points 
except (t^,xO, / = 1, t? - 1. At these points 
the functions may have jump discontinuities; 
i.e., the functions will have limits along any 
path, but limits along different paths may 
have different values. In general, even for 
problems with very regular functions, u(t) 
may have jump discontinuities. Therefore Ult) 



is expected to be only piecewise continuous. 
The resulting state x(t), therefore, will have 
only a piecewise continuous derivative in 
general. 

The aUowed discontinuities of /©, /, 

important role in many 
real-world problems. This feature allows for 
completely different forms of state equations, 
constraints, etc., for different ranges of state 
and time. It is, therefore, possible to routinely 
account for sudden changes in system be¬ 
havior such as reverse in direction of frictional 
force, motion of objects in a space where 
physical barriers or restraint surfaces exist, 
logic built into the system which changes 
configuration as in staging of rockets, etc. It 
should be clear that these features are re¬ 
quired in order to treat many realistic prob¬ 
lems. 

For a discussion of the effect of these 
discontinuities on more detailed necessary 
conditions and sufficient conditions, see Ref. 
8 . 

6-3.2 A MULTIPLIER RULE 

As mentioned in par. 6-3.1, real-world 
optimal design problems require at least the 
complexity of the Bolza problem of Def. 6-2. 
In fact, the system designer requires all the 
tools the mathematical theory of optimal 
processes can give him. This requirement 
points out one of the obstacles to engineers in 
utilizing the modern theories of mathematics. 
This text cannot possibly present the mathe¬ 
matical theory required of the research math¬ 
ematician who is developing the theory of 
optimization. The approach taken here to 
by-pass this obstacle is to accept a key 
theorem of Functional Analysis and then 
proceed to develop the tools required for 
solving problems of optimal design. A very 

6-17 


Digitized by L^ooQle 



AMCP 706-192 


powerful theorem of Liustemik and Sobolev, 
Ref. 6 , page 209, will be used to obtain 
necessary conditions for the problem of 
Bolza. 

Theorem 6-4: If u(t), b, and x(t) pro¬ 
vide a solution to the Bolza problem of Def. 
6 - 2 , then there exist multipliers Xo > 0 , 7 ^, a 
= 1, .... r, \/t), 1=1,.... /I. and jS = 1,.... 

q, not all zero, such that 


67 = 0, (6-56) 

where 

Ot * 1 


+ 



^ofo(f.x.u.b) 


(6-57) 


+ S \/t) 
<•1 


' dXf 

—-f,(t.x.u.b) 


+ y^Ljt.x.u.b) 


fl-l ® 


.jc.w.W 


dt. 


Note that the symbol 67 is the first 
variation of 7 as defined in par. 6-2. For 
proofs of this multiplier rule, the reader is 
referred to the literature (Refs. 2,5-9). 


This theorem says nothing about the con¬ 
tinuity and differentiability properties of the 
solution x(t), iHt), and the multipliers \J[t) 
and t). In general, piecewise continuity is 
all that may be expected of u(t). Eq. 6-51 
then implies x(t) has a piecewise continuous 
derivative. The properties of \/t) and 
will be determined when necessary conditions 
are derived. 
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6-33 NECESSARY CONDITIONS FOR 
THE BOLZA PROBLEM 

The simplest Bolza problem is the one 
having all its functions three times continu¬ 
ously differentiable. Even in this case, how¬ 
ever, 1) may be only piecewise continuous. 
To include this possibility, let t* be a point of 
discontinuity of any component of Ult). 

Before computing the variation called for 
in Eq. 6-56, it should be noted that r®, t*, 

and are not fixed but must be determined. 
This means that these special points must be 
treated as parameters that are to be deter¬ 
mined, much as the design parameter b. At 
first glance, this may seem to introduce no 
essential complication into the problem. The 
behavior of the allowed variations in xff), 
however, must be treated very carefully. 

Let f be a typical point ot t* where xf t) 
may very weU be discontinuous. The function 
x(t) will be changed at i by both the 
independent variation in x(t)y bx(t)^ and the 
shift in the point ff. bt). Denote the total 
change ’mx(i) due to both of these sources by 
Axf i). It must be assumed that there are no 
other points or t* arbitrarily near f, so 
limits from the left and right exist. For 
x( t) is continuous so the total change Axf t) in 
x(i) due to bx(t) and bt is continuous and 

Ax(i) = Axf f - 0> = Ax(t + OA 
where 


Axf t - 0) - bx(t - 0) x(i - 0) bi 


and 

Axff+ 0> = 6 xff+ 0) + xff+ 0)6f. 

It should be noted that this condition 
imposes restrictions on bx(t — 0 ) and 6 x(f + 
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0). In particular, they are not necessarily the 
same so 6x(f) on f® < f < t’* need not 
necessarily be continuous. 

Before enforcing Eq. 6-56, put J in the 
form 


a *1 

^ \ I * /-I 

[dx, ] r 

*V-^-fiit.x.u.b) \L^it.x.u.b) 

+ ^^ ii^U)^fU.x.u.b) I dt 


(6-58) 

where the argument of the pair of braces is 
the same as that of the integrand of the first 
integral. Note that t* and are simply typical 
elements of their respective classes. 

For convenience in the development that 
foUows, define 


a« 1 


H(t^,u,b.\’'f,li) - \^(t)f(t,x,u,b) 

- Kfoit.x.u.b) 

- j^y^LJi.x.u,b) 


fi’i 

so that Eq. 6-58 becomes 

_ I 

J = G + 


- 2 ti^U)^^(t.x,u.b) 


(6-59) 


dt (6-60) 


f Kl-"]- 

I-.]* 

Eq. 6-56 may now be applied to yield 


^ ac „ ac ac „ 

0 =— Ax® +... +— Ax® +-^ 6/® + ... 

ax® ax*i at® 


+-— 6 /® +— 66 
bb 


8 « , 9ff ,1 ^ 

1 ' r .r, , 

W , 3H .1 ^ 
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r'-O 

tf + Q 


dt‘ 


-[ 


X^(r®+0) 


dx{t + 0 ) 
dt 


- H(t'> + 


Q,x.u.b) «/® 




n r^-0 

J f^ + 0 


[ 


+ - 0 ) 


dx(t^ - 0) 

dt 


-Q,x. u. St^. 

Integrating the first terms in each integrand 
by parts yields 


ax® a;^ ar® 


ac ac7 

+ ...+— 6 r® + — 

dfi db 


tl 


- 6 x +— 

dt ax 


-u- 

+ — 5ii +-“7- bb dt 

bu bb J 

- j ^ l 1 dr - j ' ( 1 dr 

+ [X’" (t^ - 0) - X ^(tf + 0)1 Ax^ 

- [Hit^ - 0) - mtl + 0)1 6ti 

+ [X^ (f * - 0) - X^ (t* + 0)1 Ax(r*) 

- [W(r* + 0) -//(r* + 0)lfir* 


+ X^(f" - OlAx** - Hit'> - 0)St'> 

- X^ (r® + 0) Ax® + Hit*> + 0)6f®. 

Since the variables Ax', 6r*, 6b, 6x(r), ana 
6ii(r) are arbitrary, (Ax' are taken as arbitrary 
along with 6t^ so that 6x* = Ax^ - x^6t^ is 
fixed) Lenuna 6-1 applies. Application of this 
Lemma yields 

Theorem 6-5: If (x(f), ii(r), b, x^) is a 
solution of the Bolza problem of Def. 6-2, 
then there exist multipliers X© > 0, X^.(/), i = 


1 . n, y^, o= 1 . r, M/r),|S= 1 ,.. 

all zero, satisfying the conditions: 

q. not 

dX ^H"^ . ^ 1 

(6-61) 

-^= 0 , for t # tf 
du 

(6-62) 

f f® 

1 aw. „ 

(6-63) 


^o-Mr®) = 0 


bG^ 

bx^ 


+ X(r^) = 0 


bQf 

—r +X(f^- 0 )-X(r/ + 0) = 0 
ox' 


(6*64) 


i£ 

ar® 


+«(r® +0) = 0 




tiC* 

H{tl - 0) + Hit! + 0) = 0 


H{t* - 0 ) - Hit* + 0 ) = 0 


(6-65) 


( 6 - 66 ) 
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X(r^ - 0) - X(f^ + 0) = 0 (6-67) 

Note that the necessary conditions, Eqs. 
6-61 through 6-67, are linear and homoge¬ 
neous in the multipliersX©,X^(r),7^^, It 
is, therefore, permissible to choose the magni¬ 
tude of one multiplier so that the remaining 
multipliers will be uniquely determined. It 
seems reasonable that if the necessary condi¬ 
tions obtained by setting 67 = 0 are to be 
related to minimization of 7, then Xq should 
not be zero. This is indeed the case and if Xq 
is required to be zero by the necessary 
conditions, then the Bolza problem is “abnor¬ 
mal” in a sense. Most meaningful problems 
are normal as defined in Refs. 7, 8 , and 9 and 
require X© # 0 . In solving problems using the 
necessary conditions of Theorem 6-5, one 
should first verify that Eqs. 6-61 through 6-67 
have no solution if X© = 0. It is then 
permissible to put X© = 1 so that the 
remaining multipliers are uniquely deter¬ 
mined. 


equations to determine the unknowns. It does 
not assert that a solution of Eqs. 6-61 through 
6-67 exists. Existence theory for these prob¬ 
lems is a difficult question that is treated in 
Refs. 10 and 11. 

The conditions of Theorem 6-5 are very 
nearly the famous Pontryagin Maximum Prin¬ 
ciple (Ref. 12). The condition that completes 
the Maximum Principle is an inequality which 
follows from the Weierstrass condition of the 
calculus of variations. This condition is given 
as Theorem 6 - 6 . 

Theorem 6-6: In addition to the condi¬ 
tions of Theorem 6-5, the solution of the 
Bolza problem must satisfy the condition: 

< //[r,x(r),u(O,6,X(O,7,01 ( 6 - 68 ) 

for all admissible U and all t, < t < . 


Even though Eqs. 6-61 through 6-67 are 
very complicated, it is interesting to note that 
they provide just the right number of equa¬ 
tions to solve for all the unknowns. Eqs. 6-51 
along with Eq. 6-61 form a system of 2n 
first-order differential equations for x(t) and 
X(f). Further, the first and last members of 
Eq. 6-64 may be considered as 2n equations 
in boundary conditions on X and x. This is the 
proper number of boundary conditions. The 
second equation of Eqs. 6-64 provides any 
jump conditions in X(r) at the intermediate 
points 0 < / < T?. Fqs. 6-65 may be 
interpreted as determining / = 0 , 1 , ..., t?, 
and Eq. 6-66 determines t*. Eq. 6-67 simply 
states that X is continuous even at jump 
discontinuities in u. Finally, Eq. 6-63 deter¬ 
mines the design parameter b. 


For proof of this theorem see Refs. 8 and 
13. 


Another useful result is the following iden¬ 
tity: 


dH 

dt 


—, for t ^ r. 
bt 


(6-69) 


This condition is useful in case H does not 
depend explicitly on t. Then H is constant 
between the points and at these points it 
may have discontinuities governed by the 
third equation in Eq. 6-65. 

To prove this relation holds, compute 
formally 


It should be clear that this argument only 
shows that there are the proper number of 


dt bt bu dt bx dt b\ dt 
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dx ^ dH „ dH 
Since— = = 0, — 

at ou ox 


dt ’ 


UlM. ^ 

and-— =/^, this is 

OA 

dt bt ~ dt ‘dt~~^ 


as required. 


6-3.4 APPLICATION OF THE BOLZA 
PROBLEM 

In order to obtain familiarity with the 
Bolza problem, several examples will be con¬ 
sidered. In order to illustrate the basic ideas 
associated with the Bolza problems, these 
examples will be elementary. In real-world 
problems the engineer should be prepared for 
complexity that will probably force him to 
use a numerical method of solution. For 
examples of the Bolza problem in the field of 
aerodynamics, a field which contributed 
greatly to optimal design theory, see Refs. 
15,16, and 17. 

Example 6-6.- Maximum Range Rocket- 
assisted Projectile 

A projectile of mass m is acted on by a 
fixed force F as shown in Fig. 6-8. The angle 
of d(t) is measured from the x-axis, where the 



Figure 6-8, Particle in Motion 


X- and y-axes are fixed in inertial space. In the 
problem discussed here, 6{t) is to be chosen 
so as to direct the motion of the particle. 
Hence 0(f) is the control variable. 

Denoting horizontal and vertical compo¬ 
nents of velocity of the projectile by u and v, 
respectively, the motion of the projectile is 
governed by the equations 


x = u 


y = V 


. F ^ 
u =— cos B 
m 

V =— sm 0 -g, 
m 


(6-70) 


where 


' dt 


The projectile is fired from a gun at time t 
= 0 with x(0) = y(0) = 0 and initial velocity 
ii(0) = V cos 00, KO) = V sin 0o, where V is 
the muzzle velocity of the projectile. The 
problem at hand is to choose 0o and 0(/) so 
that at some future time T, the projectile will 
hit the earth as far as possible from the launch 
point, i.e., y{T) = 0, x(T) = maximum. 


In the notation of the Bolza problem, 0(f) 
is a design or control variable M(f), 0o is a 
design parameter b, T is terminal time t ^, and 
(x,y,ii,v) is the state. The quantity to be 
minimized is 


J = go[b.t^.x/) = -x(D. 


Boundary conditions on the state variables are 


6-22 


Digitized by L^ooQle 



AMCP 706-192 


gi =a:(0) = 0 

82 =>'( 0 ) = 0 

83 = u(0) - K cos = 0 

^4 = *’(0) - sin 00 = 0 


8s =y{T) = 0. 


(6-71) 


■Ji =K(.0) 


y2=\(.0) 


:=73=x„(0) 


(6-77) 


As deflned in Eq. 6-59, 

G = - XoX(71 + 7 , j:(0)+ 723^(0) 

+ 73 lu(0)-^"cos0ol (6-72) 

+ 74 l»’(0) - Ksindol +7s3'(70 


dG 

3K0) 


, = -Xo = -X,(n 


// = M + V + — cos 0 

* y “ m 




(6-73) 


where variable named subscripts are used for 
the X’s. 

Theorem 6-5 yields as necessary conditions 


dG 


■ = 0 = \(D 


■^0 = \(D 


0 = H(T-0) 


(6-78) 


(6-79) 


-^= X 
du * 


(6-74) 


Xjj cos 0 + X^ sin © < Xj^ cos 0 


+ X„ sin 0 


(6-80) 


X =-^=-X 


3// ^ 

— = 0 = --sin 0 +-cos 0 (6-75) 

uu m tfi 


3c f ^ aw , „ „ 

Jo^« " 


for all admissible 0. 


Eqs. 6-74 yield 




\ “ €2 




- 74 F' cos ^0 


(6-76) 
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( 


The last two equations of Eq. 6-78 imply (3 = 
(iT and ^4 = ( 2 ^- If we assume the problem 
is normal, X© = 1 so X^(r) = 1 = f i, so 


Nc = ' 
\=(2 


(6-81) 


/ 


Substituting from Eqs. 6-77 and 6-81 into 
Eqs. 6-75 and 6-76 

r sin 00 - ^2 7’ cos 0 o = 0 

and 


t^F 

xit) -tV — cos 00 
2 m 


y{t)-tVsmeQ -~f^g 


t^F , ^ 


> (fr84) 


By use of these equations, the last equation in 
Eq. 6-71 yields 


1 TF 

T{V sin 00 —-gT +■— sin 0o) = 0. 

2 2 m 


This implies 



(6-85) 


- (T - f) sin 0 + ^2 (^ - f) cos 0 = 0 . 


For all r # T, this is 
— sin 0 + f 2 cos 0 = 0 . 


The one condition which has not been 
used is Eq. 6-79. By substitution of Eqs. 6-81 
and 6-83 into Eq. 6-73, Eq. 6-79 becomes 



cos 00+^2 1 sin 00 




These equations imply 
^2 = tan 00 


(6-82) 


I 

Integrating the last two equations in Eq. 


6-70, 

u(t) = K cos 00 


—cos 00 
m 


tF 

v(t) = F sin 00 +— sin 00 - .g/ 
m 


(6-83) 


Integrating again, 
6-24 


TF 


sin 0 ( 


m 


-gT^= 0 . 


By use of Eq. 6-82 this becomes 




-gr sin 00 = 0 , 


or 


TF 

V H-- gT sin 00 . 

m 


Combining Eqs. 6-85 and 6 - 86 , 


HI (-S) 4'- 


( 6 - 86 ) 


Digitized by L^ooQle 



AMCP 706-192 


or 


(■ 


2 m^ 



3 FV 

+-r+ =0 

2 m 


so 



Substituting T from Eq. 6-87 into Eq. 6-85 
then gives an easy equation for 

While the results of this problem are not 
particularly useful, the solution does illustrate 
the use of the various conditions in Theorem 
6-5 in generating a candidate solution of the 
problem. The reader, however, should not be 
led to believe that all Bolza problems may be 
solved in closed form as in this example. 

In more general problems the adjoint equa¬ 
tions, Eq. 6-61, cannot be solved so easily in 
closed form. Further, the equation dH/bu = 0 
may not yield so simple a condition as Eq. 
6-75 for the design variable. It is often of 
value to keep a procedure in mind for 
determining the various unknowns as in this 
problem, even though more realistic problems 
may require numerical methods at each step 
in the procedure. 

Example 6-7: Minimum Fuel Orbit Trans- 
fer 

A rocket equipped with a constant thrust 
engine is to transfer from a circular earth 
orbit of radius r© to one of radius F > r© 
using a minimum of fuel. The time allowed 
for this transfer is T. Further, it is possible to 
shut the rocket down during one time interval 
of the transfer if desired. The orbits are 


illustrated in Fig. 6-9 and the time scale is 
shown in Fig. 6-10. 


The times f i and t 2 may actually coincide, 
depending on the problem parameters. These 
times play the role of in the Bolza problem. 
The equations'of motion of the spacecraft are 
taken as 


, M h{t)Fsmd 

u =— --T+- 

r r^ m 


, MV h it)F cos 6 

V =— +- 

r m 


m = hit) q 


where 


> ( 6 - 88 ) 


/ 


hit)^ { 


\ 


1, 0< r < 

0 , /i < r < r 2 
\^t2 < t < T 



\ spacecraft 
\ 

\ 

\ 

\ 


Figure 6-9. Orbit Transfer 


Thrust On Thrust Off Thrust On 



Figure 6-10. Thrust Program 
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( 


and the boundary conditions are 
r(0) =ro. «(0) = 0 
>'(0) = /n(0) = mo, 

riT) = R, u(70 = O 


(6-89) 


where 
r = radius 
u = radial velocity 
V = tangential velocity 
m = mass of spacecraft 
M = gravitation constant 
F = thrust 

q = mass flow rate during thrust 

0 = thrust orientation 

Since - m(r) is the amount of fuel 
consumed up to time f, the object here is to 
minimize 

/ = mo - m(T). 


For use in Theorem 6-5, define 
G = mo - m(D + [r(0) - To 1 +72 w(0) 

+ T3[v(0)-(/i/ro)‘''] 

+ 74 1^(0)-mol +7s [r(r)-^l 

+ 76«(n + 77[v(70-(M//?)''’j 

6-26 


and 


^ Lr m J 

+ Xy uv ^ hCQFsin^ j 


The necessary conditions of Theorem 6-5 


are 


. , r ^(t)Fcos 0 '| 


> (6-90) 


h(t)F cos 


] 


X„(D = -l 
X(r, - 0) = X(f, + 0) 

X(/, -0) = X(r, +0) 

H(ti -0) = /f(f, +0) 

Hitj -0)=H(ti +0) 
the are those shown in Fig. 6-10. 


(6-91) 


(6-92) 


(6-93) 


(6-94) 
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The prospect of solving this set of equa¬ 
tions in closed form is dim indeed. A general 
procedure can be discussed, however, and the 
actual solution can be obtained using numeri¬ 
cal methods discussed in a later paragraph. 


From Eq. 6-91, 0(0 may be determined as 


0(0 = Arctan 



(6-95) 


for t not in 0 < r < ^2 > where 0 need not be 
defmed. The result of Eq. 6-95 may be 
substituted into Eqs. 6-88 and 6-90 so that 
these equations become a set of eight first- 
order differential equations for the state and 
adjoint variables. Eqs. 6-89 and 6-92 form a 
set of eight boundary conditions for these 
variables. Eqs. 6-93 show that the adjoint 
variable is continuous and Eqs. 6-94 deter¬ 
mine ft and A numerical procedure may 
be used to solve this problem. The resulting 
adjoint variables may then be substituted into 
Eq. 6-95 to obtain the explicit design (or 
control) variable. A problem of this kind is 
discussed in Ref. 18. The method used there 
to construct a solution is completely different 
from the one proposed here. 


64 PROBLEMS OF OPTIMAL DESIGN 
AND CONTROL 

The Bolza problem of par. 6-3 is of almost 
the generality required for optimal design. 
The principal shortcoming of that problem is 
in the lack of generality in the constraints. It 
has been noted in preceding chapters that 
meaningful optimal design problems generally 
involve inequality constraints. It is the pur¬ 
pose of this paragraph to extend the Bolza 
problem to account for inequality constraints. 

The problem treated here is given in Defini¬ 
tion 6-3. 


Definition 6-3 (Problem of Optimal De¬ 
sign): The optimal design problem is a prob¬ 
lem of finding ii(f), b, x(f), t^ < t < t^, which 
minimize 

f"" 

■t I /, (6-96) 

subject to the conditions 

^ u.b), r® (6-97) 

at 


g^ib,tl,x>)-^ 


= 0 ,a=l. 


g„(b.t^,xf) + 



< 0, a = r'+ 1, ...,r 
(p^{t,x,u,b) = 0,&= 1, 

f® <; r« r” 


and 


[t,x(t),uit),b]dt 


(6-98) 

[t,xit)Mt),b]dt 

(6-99) 


( 6 - 100 ) 


0^(r,x,M,b) < 0, = + 1 


t^ < t< t^. 




( 6 - 101 ) 


The variables and functions appearing here are 
identical to those in Def. 6-2. 


The inequalities in this problem are treated 
here in the manner presented in Ref. 13. The 
inequality constraints are first reduced to 
equality constraints, and the results of par. 
6-3 are applied. In order to perform this 
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reduction, define the slack variables a = r' 
+ 1 ,.... r and w^(r), = < 7 ' + 1 ,.... q by 

fr» 

gaib,t^,x^)+ I L^[t,x(t)Mt),b\dt 

( 6 - 102 ) 

+ vl = 0 ,a = / + l,...,r 


and 


0 ^(f,x.M.b) +(0 = 0 , 

j3 = ^'+ 1 , ,..,q 


(6-103) 


The constraints, Eqs. 6-102 and 6-103, are 
equivalent to Eqs. 6-99 and 6-101, respec¬ 
tively, where and w ^(0 are interpreted as 
design parameters and design variables. With 
these equality constraints replacing the in¬ 
equality constraints, the optimal design prob¬ 
lem becomes a Bolza problem. The necessary 
conditions of par. 6-3, therefore, may be 
applied to this modified problem. 

The form of the constraints, Eq. 6-101, has 
a great deal to do with the behavior of the 
problem. If some function 0^ depends only 
on r, X, and b then the problem is compli¬ 
cated in intervals in which 0 ^ = 0. This kind 
of constraint will be referred to as a state 
variable inequality constraint and will be 
treated separately. If 0 ^ does depend explicit¬ 
ly on u, then the constraint is referred to as a 
design variable inequality constraint. This 
problem will now be investigated. 


6-4.1 DESIGN VARIABLE INEQUALITY 
CONSTRAINTS 

In order to apply Theorem 6-5 to the 
problem of Eqs. 6-96, 6-97, 6-98, 6-100, 
6-102, and 6-103, the independence of condi¬ 
tions expressed by Eqs. 6-100 and 6-103 must 
be verified; i.e., the matrix, Eq. 6-55, is 

6-28 


required to have rank q. For this purpose, the 
design vector must be considered as (u^, 
w^)^, where w^). 


The matrix, Eq. 6-55, becomes 


301 

30, 

3m, ’ 

nt 




dM, ’■ 

,7-=- ,0, 
ou^ 

m 



bui 

’ dM 

m 

»0, 

30^ 

dM, ’ ■■ 

bu ’ 
m 


...,0 


...,0 




0 . 


2 w. 


(6-104) 


...,0 


This matrix is required to have rank q. In 
order for this to be possible the number of 
columns, m q - q\ must be greater than or 
equal to the number of rows, q, ox m - q* > 
0. Further, it is obvious that the first q* rows 
must be linearly independent, or the entire 
matrix could not possibly have rank q. Next 
note that if # 0 , then the ofth row must be 
linearly independent of all the other rows 
since it has the only nonzero element in the m 
+ a — < 7 'th column. Therefore, linear inde¬ 
pendence of the rows from q' + 1 to <7 need 
only be considered for those a with w = 0 . 

a 

By Eq. 6-103, this is the same as 0 ^ = 0. 


The conclusion is, then, that the matrix, 
Eq. 6-104, will have rank q if and only if the 
matrix 


30/“ 

—J, 0 , = o (6-105) 
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is of full row rank. This simply says the 
gradients of all constraint functions (which 
are equalities) with respect to the design 
variable must be linearly independent. Assum¬ 
ing this is the case, Theorem 6-5 may be 
applied. 


and the conditions, Eqs. 6-110 through 6-113, 
are unchanged. Further, Theorem 6-6 yields 

< ^[/.x(O,M(r),b,X(r),0,v.w] (6-115) 


Defme 


r 

G = Xo«o + 2 7 „g„ 

Ot* 1 

(6-106) 

r 

G'= 2 7„v2 

ot=r'+l “ " 

(6-107) 

r 

4 


// = XV-Xo/o- 


(6-108) 


r 



2 


(6-109) 


The quantities H- H H* and C = (/ + C?' 
take the place of H and G in Theorem 6-5. 
Necessary conditions for the optimal design 
problem are, therefore. 


d\ _ dH^ 
dt dx 

( 6 - 110 ) 


( 6 - 111 ) 

o 

II 

( 6 - 112 ) 

f'” 

^G 1 ^ ^ 

( 6 - 113 ) 

dC' 1 „ 

17 -] ir*"’" 

J fO 

( 6 - 114 ) 


for all admissible U, 

The condition, Eq. 6-112, in scalar form is 
-2M^vi;^=0,i3 = (7'+l,...,^?. (6-116) 

If = 0, then by Eq. 6-103 0^ = 0. If 
# 0,0^ < 0 and = 0. Therefore, Eq. 6-116 
is equivalent to 

= 0, p = + 1,..., q . 

(6-117) 


Condition, Eq. 6-114, is 

f 

27 . v, + 


(" 

I 27„v„</f = 0, 


a = r'+ 1, ...,r 


or 


27 „v^( 1 -r®) = 0, a = r'+l,...,r. 
Since 1 + r’* - # 0, 

7 «v^=0, a = r'+l,...,r. (6-118) 

If =0, then by Eq. 6-102, the constraint, 
Eq. 6-99, is an equality. If 0, then the 
constraint, Eq. 6-99, is a strict inequality and 
7 ^ = 0. Therefore, Eq. 6-118 is equivalent to 
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T« I gcib,t'.x>) 

+ j i,u,x(o,«a),6i d/I =0, 

a = r'-H. r. (6-119) 

The conditions, Eqs. 6-116 and 6-118, 
imply H* and C' = 0 so that H= H and G 
= G. The necessary condition, Eq. 6-115, is, 
therefore, just 


< H [tMt)Mt)AHt\yfi] (6-120) 


for all admissible U, It is further shown (Refs. 
5,10,12) that Xo > 0, 7 ^ > 0, a = r' + 1, ..., 
r, and /i^(f) > 0, /3 = q' + 1, .... q, < 
t < 

The conditions obtained through applica¬ 
tion of Theorem 6-5 to the optimal design 
problem may now be stated as Theorem 6-7. 


Theorem 6-7: If [x(0, m(0» b, x^\ is a 
solution of the optimal design problem of 
Def. 6-3 and if the matrix, Eq. 6-105, has full 
row rank, then there exist multipliers Xq > 0 , 
\(r), / = 1, .... m. 7cr» « = 1..... ^ 7cr > 0, a = 
r’ + 1 ,....r. P = 1, q, > 0, ^ = q' + 
1, .... q. not all zero, and functions G and H 
of Eqs. 6-106 and 6-108 such that 


dt 

bu 

bb 


r ^ ! 

= ---— ,forr=^r 
bx 

(6-121) 

= 0, for t^t^ 

(6-122) 

rt^ 

- 

(6-123) 
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bG^ 

dx® 

bx^ 


-X(r®) = 0 


+ X(f’») = 0 


bG^ 

-0)-UtU0) = 0 


■h 0 ) = 0 


dG 

— -//(/-- 0 ) = 0 


dG 


— _ Hit! -0) + Hit! -h 0) = 0 


X(^•- 0 )-X(^♦■^ 0 ) = 0 


lifit)(l>pU,x.u,b) = 0, /? = 1,.... q 


To I gAb.tKxh 


i: 


+ I L^{tMt\u{t\b\ dt 


a = 1,.... r 


dt 


bt 


, for r t^ 


and 

//[r.x(r).C/.b.X(f).7,01 
<^[f.x(f).ii(O.^A(0,r01 
for all admissible U. 
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(6-125) 

(6-126) 
(6-127) 
(6-128) 

0 

(6-129) 

(6-130) 

(6-131) 
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It should be noted that, just as in Theorem 
6-5, the number of conditions here is just 
equal to the number of unknowns, so one 
might be led to believe that a solution may be 
found. Existence of a solution is, however, a 
very difficult question that is treated in Refs. 
10 and 11. 

64.2 STATE VARIABLE INEQUALITY 
CONSTRAINTS 

In many meaningful design problems con¬ 
straints may involve restrictions only on the 
state variable. This is the case when some 
of Eq. 6-101 depends only on jc, and b. To 
study this problem, just one such constraint 
needs to be considered, i.e., 

< 0, r® . (6-132) 

Let r < t < r < be an interval in 
which of Eq. 6-132 is an equality. It is 
dear that = 0, so the matrix, Eq. 

6-105, has a zero row in this interval and 
hence cannot be of full row rank. Theorem 
6-7 cannot be applied directly, so further 
analysis is required. 

In the interval < t < , 0^ = 0 so it is 

necessary that 


plicitly on M(f), then this equation is of the 
form required in the problem treated in par. 
6-4.1. If not, then differentiating through this 
equation with respect to t and using the chain 
rule of differentiation 



dt^ ^ dtdx ^ 




dx* 


/ + 


dx dt ' 


(6-133) 


where all the arguments are omitted. If the 
right side of Eq. 6-133 depends explicitly on 
u{t) then this equation is of the form treated 
in par. 6-4.1. 

This process continues until 

0=_^ (6-134) 


involves u{t) explicitly in its right side and 
u{t) can be determined as a function of x{t) 
and b, as in par. 6-4.1. The integer A 1 
is defined to be the first integer for which this 
is true. The constraint, Eq. 6-132, is then 
called a p^th order state variable inequality 
constraint. 



dx 

+ ——. 
dx dt 


From Eq. 6-97, dx/dt may be replaced by / 
and this relation becomes 

0 *—- - —^- 

dt dt 


d<^At.x{t)M 

+ - f{t.x{t),uU).b\ . 


From the theory of ordinary differential 
equations (Ref. 14), Eq. 6-134 throughout 
r < t < t* and 

<Pflt-.xit-),b]=0 (6-135) 


d Pfi 

—^ [r-,jc(rx/>l =0, / =! 

dt^ 




(6-136) 


If the right side of this equation depends ex- are equivalent to 0^ = 0 throughout t < 
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f < . This, of course, requires that 

have piecewise continuous derivatives, 
and f have piecewise continuous 

derivatives in r” < f The point 

t~ plays the role of a in the problem 
stated earlier in this paragraph. 

It will be assumed that when the right side 
of Eq. 6-134 is used in place of in 
computing the matrix, Eq. 6-105, this matrix 
has full row rank. In this case Theorem 6-7 
may be employed. To utilize this theorem, 
define 



I (6-137) 


where = 0 if 0^ involves u explicitly, 

C = Xo?o+ S Kg^ (6-138) 

Ot * 1 

Vi f I 

\ ^7e -(r ,x(0,6l[(6-139) 

/=o [ ‘>0 j 

where this sum on P is extended only over the 
indices associated with state variable in¬ 
equality constraints, are multipliers, and 


H = yJf-\ofo- 2 

Of * 1 

(6-140) 

With G = G + G and H replacing G and H in 
Theoiem 6-7, a set of necessary conditions for 
this problem are obtained. They are easily 
computed and are given here as Theorem 6-8. 
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Theorem 6-8: If [x(0, u{t), b, is a 
solution of the optimal design problem with 
state variable inequality constraints, then 
there exist multipliers X© > 0, X^(f), / = 1,.... 
Ta* ^ ^ Ta ^ 0, a = r' + 1, ..., r. 

13 = l,....q,M^(0> 0,l3=q' + l,...,q. and 
/ = 1, ..., and jS associated with a state 
variable constraint and 0^ = 0 in f < r < 

— ,forr“. (6-141) 

at ox 

hH 

—= 0, for (6-142) 

bu 


dG dG dH , 


bG^ 

ax® 

ac^ 

ax” 


- X(/®) = 0 

+ X(r”) = 0 


dG^ 

+ Mtf - 0 ) - \(tf + 0 ) = 0 
bx' 

bG^ 

+ Mr - 0) - X(r- + 0) = 0 

OX 

Mt* - 0) - Mt* + 0) = 0 


+^(r® + 0) = 0 




X6-144) 


ac 

ar^ 


ac 


- - 0)+ H(tf + 0) = 0 ye-145) 


— -Su- -0) + 6ir +0) = 0 
HU* - 0) + HU* +0) = 0 
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H(t* -a)-H(t* + G) = 0 

(6-146) 

X(r*-0)-X(r* + 0) = 0 

(6-147) 

= 0, (3=1,. 

/ 

... q (6-148) 




Ct^ ] 


+ LJtMt)Mt)Mdt } 

= 0, 

J f* ) 


a= 1 ,..., r 

(6-149) 

dH bH * 


dt dt 

(6-150) 

and 


Hlt.xUlU.bMt).y,0] 


< ^l/.Jc(r),«(0.6,X(a7,01 

(6-151) 


for all admissible U. 

The full set of necessary conditions em¬ 
bodied in this theorem is awesome from a 
computational point of view. The differential 
equations for x and X are subject to multi¬ 
point boundary conditions that involve a set 
of undetermined multipliers. In a gross sense, 
Eqs. 6-147 may be viewed as determining 
intermediate points in r® < r < r’’ and the 
associated boundary conditions on x(t) and 
X(r). 

Use of the theorem is further complicated 
by the fact that the design variable may be 
determined as the solution of £q. 6-142 
which satisHes Eq. 6-151. This means that u 
will be determined as a function of x, b and 
all the multipliers. The expression for u will 
generally take different forms in different 
subintervals of < t < and the spacing of 


these subintervals is not known before the 
solution is computed. The generality of the 
problem makes it difficult to discuss all its 
intracies without resorting to special cases and 
examples. 

6-4.3 APPLICATION OF THE THEORY OF 
OPTIMAL DESIGN 

In order to develop some familiarity with 
the methods of the preceding subparagraphs, 
several examples will be treated here. These 
problems will be idealizations of real-world 
problems but will illustrate the basic ideas 
which carry over into more complicated 
problems. 

Example 6-8: Time-optimal Steering of a 
Ground Vehicle (Ref 19) 

To illustrate the concepts presented in par. 
6-4.2, an optimal vehicle steering problem will 
be solved. This problem is chosen because of 
its clarity of formulation and solution. A 
ground vehicle (a tractor in this case) is to be 
steered so that it begins at a given point and is 
steered so that it reaches a given straight line 
path in the shortest possible time. The vehicle 
and the line it is to reach are shown in Fig. 
6 - 11 . 

Point A , midway between the rear wheels, 
is located by the coordinates Xi(t) and X 2 {t). 



Figure 6-11. Ground Vehicle 
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Orientation of the vehicle is specified by a 
third variable x^it). Steering of the vehicle is 
accomplished by choosing the angle 0(t). 
From physical grounds it is clear that the 
state of the vehicle is described by x(t) = 
[Xi(r)^ 2 ( 0 r^ 3 ( 01 ^ and the vehicle is con¬ 
trolled through choice of d(t). 


+ 74 +7s X 3 {T) 

H-\xV cos X 3 + X 2 K sin JC 3 + \^a tan B 

~ Ml “ ^ 0 ) “ M 2 (^0 “ ^)- 
The conditions of Theorem 6-7 are 


It is assumed that the rear axle of the 
vehicle moves with a constant velocity V. In 
this case, motion of the veliicle is governed by 
the differential e 

Xi - V COSX 3 

X 2 -V sinx3 

x^-a tan B 

where a = VjL. At the initial time / = 0, x®(0) 
= x\, X 2 ( 0 ) = xj, and X 3 ( 0 ) = x®. The 
terminal time T is not determined but it is 
required that X 2 (T) = x\ and X 3 (r) = 0 since 
the vehicle must be tangent to the target line 
at time T. 


jquation 


(6-152) 





— 


-Xa 


= Xi Ksinx3 
V cos X 3 


\ 


> (6-154) 


— = 0 = X3a sec^ 0 - Mi +M 2 (6-155) 

oB 

X,(70 = 0 (6-156) 

Xq “ X| {T) V cos X 3 (T) + X 2 (T) V sin x^iJT) 


The steering angle is limited by 


+ X 3 (Da tan 0(70 (6-157) 


-00 <^<^0 (6-153) 

and as an idealization it is assumed that any 
steering angle in - 0o < 0 < 0o may be 
chosen instantaneously. For a reasonable 
problem it is clear that 0o < ^r/2. Further, for 
definiteness, assume Ixjl < >r /2 and x\ > x®. 
All other initial conditions can be obtained 
from these by reflection in Fig. 6-11. 


M,(e- 0 o) = o 


f*2(9o -9) = 0 


(6-158) 


and 


dt dt 


(6-159) 


The first two equations in Eq. 6-154 yield 


The problem is now in the form described X, (f) = 

in par. 6-3. For use in Theorem 6-7, 

X 2 (f) = t 2 

C = Xo^-^7, [x,(0) -X®1 

and Eq. 6-156 implies = 0. The last 
+ 72 [■’^2 (0) -Xjl +73 [^ 3 ( 0 ) - xj] equation in Eq. 6-154 is then 
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X3 = - ^2 ^ COS JC3. 

Using the first equation in Eq. 6-152 to 
replace V cos JC 3 , this is 

X3=-t2^1. 

Therefore, 


X3(0 = -t 2 A:i(/) + t3. (6-160) 

The behavior of 0(/) may be isolated to 
two different cases. The first is |0(/)| = 

The second is |0(/)| < 0o» which case Eq. 
6-158 implies Mi(0 = MiCO = 0- Eq. 6-155 
then shows that X 3 (/) = 0. By Eq. 6-160 then 
Xi(r) is either a constant or {2 = fa = This 
and Eq. 6-157 then implies X© = 0 so all X^ are 
zero. This is forbidden by Theorem 6-7, so 
XiU) is a constant when | 0 (r) I < But if 
jci(r) is constant jC |(0 = 0 and the first 
equation in Eq. 6-152 implies jc 3 (/) = 0. The 
last equation in Eq. 6-152 implies 0(t) = 0. 

It is clear then that if | 6 (/)| < Oq for some 
interval of time, the path of the vehicle must 
be a straight line parallel to the X 2 'axis in Fig. 
6 - 11 . 

Since the last two terms in H are zero, the 
only explicit dependence of H on 6 is through 
the term X 3 (/)a tan d(t). The inequality, Eq. 
6-131, states that 6(0 must maximize//. It is 
clear then that if X 3 (/) ^ 0 , then 

0(0 = 00 sgn [X 3 (/)l, (6-161) 

where 


191 

sgnq =—. 
Q 


Further, it is clear that 6(0 = 0 is possible 
only when X 3 (r) = 0 . 


Since < ir/2, for t small, either 6(0 = 0o 
or 6(0 = - 00 * From Fig. 6-11, it is 
reasonably clear that 6(0 = 0o and Eq. 6-152 
can be integrated to obtain 


•*^ 1(0 = +/? [sin (jcJ +60 ^ 

- sin X 31 

X2(0 -xl - R [cos (jcJ + 60 » 

- cos 1 

■^ 3(0 = +flrtan 00 , , 


(6-162) 


where 


6 =fltan 60 
R = V/b. 


This path is just a circular arc with center at 
(Xj - /? sin xj, xj + /? cos xj) and 
counterclockwise motion. 


Similarly, if 6(0 should become - 0o at 
sometime t* where Xi(/'^) = x* ^ 2 (^^)“^ 2 » 
and X 3 (r^) = xj then the path is described by 


•^i(/) = ^r Jsin (xj-60 

- sinxjl 

^ 2 (^) = ^2 [cos(xJ -bO > 

- cosxjl 

•*^3(0=^? -flftan 0 o. 


(6-163) 


This path is a circular arc with center at (x* + 
R sin X*, x* - R cos xj) and clockwise 
motion. Since this circular arc must be tan¬ 
gent to the line X 2 = Xj, the X 2 -coordinate of 
the center must be x^ - R = x* - R cosxj. 
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Note that by Eq. 6-152, XaC/) must be 
continuous, so JC| and JC 2 are continuous. 
Therefore, the tangent to the path of point A 
in the (jCi.Xj^plane is 


dxi 


= tan X 3 


xj =“(x 5 ’^Rcosx\’^x*-R cosxf). 

Further, the relation noted just below Eq. 
6-163 is 

x\ - R-x* - R cos jcJ. 


and this slope is continuous. This means that 
segments of the optimal path where 0 = — 0 o> 
0 , or Oq must be tangent where they intersect. 
With this information, the solution of the 
problem may be constructed geometrically. 

In Fig. 6-12 the initial arc, which is 
described by Eq. 6-162, is shown leaving 
A whole family of second arcs is 
shown corresponding to different values of 



Figure &1Z Extremal Arcs 


From the construction of Fig. 6-12 it is 
clear that the point of tangency of the two 
circles (xf, xj) is at the middle of the line 
joining their centers, i.e., 

Xi=^(x^ -/? sinx^ + sinacj) 


These equations yield 

-X 2 -R 

.xrj =-j(x5 +xl+R cosxl -R). 

It may be noted by examining the family 
of paths in Fig. 6-12 that if Xj > 5 = /? + xj + 
R cos X 3 , then the first arc has been followed 
beyond a time i where X 3 (f) = ir/2. At the 
point Xi(f) = xf -/? sinxj+/?, X 2 (f) = x 5 + 
R cos xj it would have been possible to 
construct a vertical portion of the optimal 
path. This construction is shown in Fig. 6-13. 

The extremal paths constructed for xj > s 
satisfy all the conditions of the theorem so 



Section 
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that they may be optimum. It is clear that for 
jCj < 5 there is only one possible solution of 
the problem. For > 5 this is not the case as 
shown for = jCj. Both the extremals 
leading to the path X 2 = satisfy the 
necessary conditions of the theorem. It is, 
geometrically, relatively clear that these are 
the only two possibilities so the one with the 
shortest time required to get to X 2 = x^ is to 
be chosen. The test, Eq. 6-151, may eliminate 
one candidate. It seems clear that when the 
extremal with straight line exists, it is best. 

It should be noted that if x^ > s + 2/?, it is 
impossible to intersect X 2 = x\ with only two 
circular arcs so the extremal with a straight 
section is required. 

This problem illustrates many of the basic 
ideas and complexities involved in optimal 
design and optimal control theory. Some of 
the features are worth noting because they 
will arise later: 

1. Pieced extremals. The conditions of 
Theorem 6-7 give a set of curves or solutions 
that must be pieced together to form the 
optimal path in state space. In the vehicle 
steering problem, these curves or arcs are put 
together geometrically. In more complex 
problems, this will have to be done analyti¬ 
cally using the conditions of Theorem 6-7. 

2. Multiple solutions. As seen in the fore¬ 
going problem, more than one candidate 
solution may be constructed. Condition, Eq. 
6-131, must then be used to choose the best 
of these candidates. 

3. Singular arcs. It occasionally happens, as 
in the vehicle steering problem, that there will 
exist a set of values of the state variables and 
multipliers such that the function H does not 
depend explicitly on the design variable. In 


this case, Eq. 6-122 provides no information. 
It is then required that the inequality, Eq. 
6-131, must be used to determine the design 
variable. For a complete treatment of this 
subject, see Ref. 20. 

The problem treated in this paragraph is 
not as complicated as most optimal design 
problems occuring in the real-world. It does, 
however, illustrate some of the features and 
difficulty encountered in most realistic 
optimal design problems. This problem should 
convince the reader that the solution of 
optimal design problems is not simply a 
matter of plugging numbers into formulas. 
Even though analytical methods will be 
stressed in subsequent work, the effective 
solution of this class of problems requires a 
sound understanding of the theory of optimal 
design. 

Example 6-9: A Constrained Brachisto¬ 
chrone Problem 

The problem considered here is similar to 
Example 6-4 but with a constraint added. It is 
required to find the path through ( 0 , 0 ) which 
lies above the line tan ol in the 

(^ir^ 2 )’Plane and that carries a particle, 
without friction, to the vertical line Xi = x} 
in the shortest possible time. The problem is 
shown in Fig. 6-14. 

This problem will be treated as an optimal 
design problem. On the assumption that there 
are no discontunities in the velocity vector, 
conservation of energy yields 

1 2- 

—= mgX2 

or 

V = (2gX2) 
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Figure & 14, Bounded Brachistochrone 


$ = ( 2 gX 2 sin u - ( 2 gX 2 tan a cos m 

= 0 (6-167) 

which does contain u explicitly. The con¬ 
straint, Eq. 6-166, is, therefore, a first-order 
state variable inequality constraint. 

In order to employ Theorem 6 - 8 , define 
multipliers - 7 ^, r“, - such that 


The equations of motion of the particle are 
then 


X| =(2gX2) COSM 

X 2 -( 2 gX 2 ) smw 


(6-164) 


where u is the angle between the X|-axis and 
the tangent to the path on which the particle 
is to travel. This angle u specifies the curve, so 
it is the design variable. The location of the 
particle is specified by the point (Xi^Xj) so 
this is the state variable. The boundary 
conditions are 


^i( 0 ) = X 2 ( 0 ) = 0 

jc,(r) = jc}. 


(6-165) 


The object is then to find u(/), jci(/), and 
X 2 U) such that a particle starting at rest at 
(0,0) reaches x{T) = xj in minimum time T, 
The path is required to satisfy the constraint 


0 = X 2 - Xi tana - h < 0. (6-166) 


Since the constraint of Eq. 6-166 does not 
involve the design variable u explicitly, the 
problem contains a state variable inequality 
constraint. Computing 0 and substituting 
from Eq. 6-164 yields 

6-38 


(? = 7’ + 7,x,(0) + 7,x,(0) 
+ 73 Ui(r)-x}l 

G = T“ (Xj - x7 tan a - /i) 

- 1/2 

H-\i{2gX2) COSM 

+ X 2 ( 2 g^ 2 )*^^ sin u 

x(cosu - tan a sin u). 


\ 


} (6-168) 


Necessary conditions from Theorem 6-8 are 


Xj =0 


< /-I 

X3 --g(7gX2) 


[Xi cosu 


> (6-169) 


+ X 2 sinu -m(cosm I 

-tan a sin u)] / 

1 /2 

( 2 gX 2 ) [- Xi sin u + Xj cosu 

+ M(sin u + tan a cos u)] =0 (6-170) 
X 2 (n = 0 (6-171) 
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— r“ tan a + Xi (r — 0) 


-X|(r-+0) = 0 

T’ + XaCr -0) 

-Xj (r- + 0) = 0 
1 - H{T - 0) = 0 


> (6-172) 


/ 

(6-173) 


- /r(/- - 0) + H(r + 0) = 0, (6-174) 

and 

dk 

-—=0. (6-175) 

at 

Ideally, the solution for u{t) might proceed 
by solving Eq. 6-170 for u as a function of X 
and This result could then be substituted 
into Eqs. 6-164, 6-167, and 6-169. The 
variables X, jc, and m could then be determined 
gnd the results substituted back into the 
previously derived equation for m- This would 
be the desired solution. It is clear that these 
steps would be extremely messy so a huristic 
argument will be used here to suggest a 
solution. This solution can then be checked in 
the conditions Eqs. 6-169 through 6-175. 


It might be expected that when 0 0, then 

the curve is a cycloid as in Example 6-5. 
Whenever 0 = 0 it is clear that m = a. This is, 
in fact, the case and as presented in Ref. 21 
the solution is a cycloid for 

( t “') 

i.e., the optimum path does not touch the 
constraint surface. 


^ 2 r /ir \ 1 

“ 1 - ( —- a ) tan a 

x\ I \2 / J 


the optimum curve is given by 


K 


«U) = 


0< t< t~ 
a, t~ <. t< t* 


wa iT-t), t* < T 


where 


w, 


«a 


[ g(a - ir/2 + cot a) 1' 

2h cot a J 

_ rg(a + cot g) 1 *1^ 
|_2(xJ +h cot Of) J 


_ _ »/2 — a 

«, 

/♦ = 7’-a/(2w,) 
and 


r2 I 

* + /i cot a)(a + cot a)J 


Fig. 6-15 shows solutions for tan a = 1/2 
and several values of h. 


The reader may very well get the impres¬ 
sion from these examples that analytical 
solutions of general optimal design problems 
are extremely difficult to obtain. This is 
indeed the case. Therefore, either numerical 
methods must be used to solve the equations 
given as necessary conditions in the theorems, 
or some direct computational method must 
be used to solve the optimal design problem. 
Some numerical methods of solving the neces¬ 
sary conditions are presented in the next 
paragraph. Several optimal structural design 
problems are solved in Chapter 7 to illustrate 
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Figure & 15. Bounded Brachistochrone 
Solution 


boundary-value problem to a sequence of 
linear boundary-value problems whose solu¬ 
tions converge to the solution of the non¬ 
linear problem. 

6-5 1 INITIAL VALUE METHODS (OR 
SHOOTING TECHNIQUES) 

In order to develop the main ideas without 
getting bogged down in notation, consider the 
problem of finding y(t) = lydt),..., 
that satisfies 

(6-176) 

at 


these methods. Direct methods of solving 
optimal design problems are presented in later 
chapters. 

6-5 METHODS OF SATISFYING NECES¬ 
SARY CONDITIONS 

The previous three paragraphs of this chap¬ 
ter have been devoted to obtaining necessary 
conditions for optimization problems of vary¬ 
ing degrees of difficulty. It has been observed 
that these necessary conditions generally 
reduce to some sort of boundary-value prob¬ 
lem, usually nonlinear. The object of this 
paragraph is to explore ways in which the 
boundary-value problem may be solved. This 
topic has received considerable treatment in 
the recent literature, so it will be treated only 
briefly here. 

Two different methods will be discussed 
here and will be applied to optimal structural 
design problems in the next chapter. The first 
method is based on a reduction of the 
boundary-value problem to a sequence of 
initial-value problems whose solutions con¬ 
verge to the solution of the original problem. 
The second method reduces a nonlinear 

6-40 


and 


= y^i , for some i 
. for some/ 


(6-177) 


where the total number of conditions in Eq. 
6-177 is n. In order to further simplify 
notation, assume the components of y{t) have 
been numbered so that the first equation in 
Eq. 6-177 holds for i = 1,..., k< n. 


Since initial-value problems are so efficient¬ 
ly integrated forward in time, the missing 
conditions on y at may be estimated as 

= = (6-178) 

and Eq. 6-176 integrated from r® to r* using 
the full set of initial conditions from the first 
equation at Eq. 6-177 and Eq. 6-178. The 
value of ) obtained from this integration 
will probably not satisfy the second equation 
in Eq. 6-177, i.e.. 


= k + \ .n (6-179) 

where t = (f^+p .... {„)^ and the notation of 
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Eq. 6-179 is introduced to illustrate the 
dependence of the final values of y on 

It is clear that a solution of the problem 
can be obtained if ( can be found so that Eqs. 
6-179 are equalities. To simplify notation, 
define the column vectors 

yU^ i) = ;{)) those / in Eq. 6-177 

and 

y^ - [yj] for the same/. 

The conditions which are to determine ( are 
= (6-180) 

Any numerical method of solving algebraic 
equations may be used to solve Eq. 6-189. If a 
scheme like Newton's Method or a Gradient 
Method is to be used, it must be possible to 
compute 

(6-181) 

where is an estimate of the solution of Eq. 
6-180. These partial derivatives may be ob¬ 
tained or approximated in a number of ways. 

The first method of determining the deriva¬ 
tives in Eq. 6-181 is to observe that yit) = 
and further, that the dependence on f 
is very regular (Ref. 14) so that 3y(/;{®)/3t 
exists. Differentiating formally with respect 
to { in Eq. 6-176, 

(6-182) 

and 

(0)=I0,...,0,..., 1,0...01, 

3 ty ik) (/) 

i = k+(6-183) 


The initial value problems, Eqs. 6-182 and 
6-183, for / = k + 1, ..., n may be integrated 
from r® to r* to obtain the derivatives 
required in Eq. 6-181. 


Once these derivatives have been deter¬ 
mined, the new estimate in Newton’s 
Method is given by 





(6-184) 

The process is repeated with playing the 
role previously occupied by f®. 

This method of finding the partial deriva¬ 
tives is direct in nature but requires the 
solution of Eq. 6-182 n - k times. Further, 
both the differential equations, Eq. 6-176 and 
6-182, must be programmed. 

A second method of constructing the par¬ 
tial derivatives of Eq. 6-181 (or approxima¬ 
tions of them) is to use a difference quotient, 
i.e., Eq. 6-176 is solved for { and f + 6 where 
6 = (0, ..., e, ..., 0)^ where / indicates the ith 
position and e is small. Therefore 

Hi c 

(6-185) 

Once these approximate derivatives are 
determined, the algorithm, Eq. 6-184, may be 
used. 

This approximate method of constructing 
the partial derivatives requires that the differ¬ 
ential equation, Eq. 6-176, be solved n - k 
additional times. It, therefore, requires 
approximately the same amount of computa¬ 
tion as the previous scheme, but all the 
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computation is performed with the same set 
of differential equations. This method is 
illustrated in the problems of pars. 7-2 and 
7-3. 


A third scheme which makes use of differ¬ 
entiation formulas for definite integrals is 
developed in par. 7-4. 

6-5.2 A GENERALIZED NEWTON METH¬ 
OD 

A second method which is used to solve the 
necessary conditions for optimization prob¬ 
lems is a Generalized Newton Method of 
solving boundary-value problems. It has been 
pointed out in the foregoing that Bolza 
problems and optimal design problems may 
be reduced to nonlinear boundary-value prob¬ 
lems. The method employed here was devel¬ 
oped for just such problems (Refs. 22, 23). 

In order to introduce the Generalized 
Newton Method for boundary-value prob¬ 
lems, consider the system of first-order equa¬ 
tions 


dt 


=g(y.t) 


(6-186) 


where 


yit) = lyi(f), .... and 

giy.t) = g„(y.t)]^ ■ 

In addition to satisfying Eq. 6-186, y(t) is 
required to satisfy 

= y®, for some/ \ 


(6-187) 


y/f*) = y}, for some/, f 

where the total number of conditions in Eq. 
6-187 isn. 
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The Generalized Newton Method for 
solving Eqs. 6-186 and 6-187 is similar in 
philosophy to the Newton method of solving 
algebraic equations. An estimate of the solu¬ 
tion, is made and the right side of Eq. 

6-186 is expanded about y^^HO using Tay¬ 
lor’s formula to obtain 

dt by 


- ^ [ty°Ht)]y^^Ht) (6-188) 

by 

where y^ ^ ^(/) is required to satisfy 
y( 1)(^0) - yO ^ fQj. tiiQse / in Eq. 6-187 
yj^Ht^ )^ylf for those / in Eq. 6-187 

(6-189) 


The boundary-value problem for y^^^(/) is 
linear so that if it has a solution, that solution 
may be obtained by superposition techniques, 
or any other technique for solving linear 
boundary-value problems, for that matter 
(Ref. 24). 

The function y^'^(/) is taken as an im¬ 
proved estimate for the solution of Eqs. 6-186 
and 6-187. This estimate then replaces y^®^(/) 
in the preceding analysis. If k is the iteration 
number for this process, then y^^Ht) is 
determined by 


dy<*) 

dt 


+/[/./*-‘VOJ 


>(6-190) 


- lty^~ (/)]y^* ^^(oj 
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and the boundary conditions 


and 


yf, for those i in Eq. 6-187 
)^yj ^ for those/ in Eq. 6-187 . 

(6-191) 

The sequence of approximations to the 
solution is considered to have con¬ 

verged when the difference between succes¬ 
sive iterates is sufficiently small. Theorems 
given in Ref. 23 show that if the initial 
estimate of the solution y^^^(t) is sufficiently 
accurate, then under rather restrictive condi¬ 
tions, the sequence converges to the 

solution of Eqs. 6-186 and 6-187. Further, 
the convergence is quadratic in the sense that 
the error at the /c + 1st iteration is propor¬ 
tional to the error squared in the /cth itera¬ 
tion. This kind of convergence is extremely 
nice. 

Even though it is difficult or impossible to 
verify the hypotheses of the convergence 
theorems in Ref. 23, it has been observed in 
practice (Ref. 23) that good convergence is 
nevertheless obtained in many real-world 
problems. 

Since the discussion in this paragraph is on 
ways of solving optimization problems, the 
Generalized Newton Method will be applied 
more directly to this class of problems. For 
the present, consider only the following prob¬ 
lem: 


minimize J = 



f(t,XM)dt 


(6-192) 


subject to 

^^f{.t.x.u) (6-193) 


Jf/(f®) = for some i 
x^(j ^) = xj for some / 


(6-194) 


where the total number of boundary condi¬ 
tions in Eq. 6-194 may be less than, equal to, 
or greater than n, jc(r) = [jc,(0 ...» x^(t)]^, 
m ( 0 =[ m , (t), 


Defining 


«-V. ♦,?. \f,. 

the necessary conditions of Theorem 6-5 are 


d\ _ 

ML 


dt 

bx 

(6-195) 



(6-196) 


and 

X^(f®) = 0,r #/in Eq. 6-194 
X,(f‘) = 0,5#/inEq. 6-194. 


(6-197) 


The argument used in applying the General¬ 
ized Newton Method to the problem of 
determining x(r), M(r), and X(r) from Eqs. 
6-193, 6-194, and 6-195 through 6-197 as 
follows: 


1. Solve Eq. 6-187 for 

u = m(/.x,X) (6-198) 

and substitute this expression into Eqs. 6-193 
and 6-195. 


2. These differential equations then form 
2n first-order, nonlinear differential equations 
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in 2n variables. Further, there are exactly 2n 
boundary conditions in Eqs. 6-194 and 6-197. 
This nonlinear boundary-value problem is 
now solved by the Generalized Newton Meth¬ 
od. 

3. The solution x(r), X(/) is then sub¬ 
stituted into Eq. 6-198 to obtain the optimal 
design function. 

Since the Generalized Newton Method, as 
presented here, is only capable of solving 
two-point boundary-value problems, inequa¬ 
lity constraints may not be treated explicitly. 
Rather, the general optimal design problem 
with inequality constraints must be reduced 
to a problem with only equality constraints. 
For example, for problems with constraints of 
the form 

< 0, (6-199) 

where 0^ depends explicitly on u a trans¬ 
formation may be performed by introducing 
an auxiliary design variable (slack variable) 
a^(r) through the relation 

w) + af (r) = 0. (6-200) 


an auxiliary parameter is introduced 
through 





( 6 - 202 ) 


where 


His)^ 


0,5 < 0 

1, s > 0. 


In a sense, 6^ is a measure of violation of Eq. 
6-201. The procedure in solving an optimal 
design problem with a constraint of this kind 
is to solve a sequence of problems with Eq. 
6-202 replacing Eq. 6-201, and ap¬ 

proaching zero as k becomes infinite; i.e., a 
modified design problem is solved imposing 
Eq. 6-202 in place of Eq. 6-201 with > 0 
chosen. This solution is carried out through 
use of the Generalized Newton Method de¬ 
scribed. The problem is then solved again with 
0 < beginning the iteration with 

the solution of the preceding problem. The 
process is repeated with 0 < 
until changes in successive solutions are suf¬ 
ficiently small. 


It is clear that with the new variable, Eq. 
6-200 is equivalent to Eq. 6-199. The neces¬ 
sary conditions of Theorem 6-5 may now be 
applied and the Generalized Newton Method 
utilized just as in the {receding case. 

In case the optimal design problems with 
state variable inequality constraints, a differ¬ 
ent technique for elimination of inequalities 
has proved effective. For constraints of the 
form 

( 6 - 201 ) 


The Generalized Newton Method presented 
here has been discussed by many authors and 
generally has received favorable comments. 
For a more detailed discussion and examples, 
see Refs. 23, 25 through 28. An outstanding 
treatment of the Generalized Newton Method 
also appears in book form (Ref. 29). A very 
rigorous treatment of existence and con¬ 
vergence properties of the method is given 
which applies to the control problems dis¬ 
cussed. The reader should note that some 
writers follow Bellman in calling the method 
described here, “Quasilinearization”. 
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CHAPTER 7 

OPTIMAL STRUCTURAL DESIGN BY THE INDIRECT METHOD 


71 INTRODUCTION 

7-1.1 THE CLASS OF PROBLEMS CON¬ 
SIDERED 

Since the beginning of engineering disci¬ 
plines, the engineer has attempted to develop 
structures and machines that perform some 
specified task. In the case of structures, a 
frame or truss is required to support a given 
system of loads. Likewise, machines and 
machine elements are required to support 
loads while they perform some function. 

The objective of the examples treated here 
is to illustrate organized methods that the 
engineer may use to obtain a load-carrying 
system which is best in some sense that is 
associated with the particular application. In 
design of commercial goods, the dollar cost of 
an element is probably the index that is to be 
minimized (Ref. 1). In military and aerospace 
applications, while dollar cost is important, 
frequently weight cost is even more essential. 
In the example problems presented here, the 
criterion of minimum weight will be chosen. 

Until very recently, most design procedures 
depended on the engineer’s intuition and 
experience in proportioning a load-carrying 
system. An analysis of the proposed con¬ 
figuration was then made to determine wheth¬ 
er the system met all requirements placed on 
it. If not, or if the preliminary design was 
obviously excessively strong, the procedure 
was repeated until a satisfactory solution was 
obtained. 


As systems become more complex and 
more emphasis is placed on minimum cost, 
the designer is unable to make all the trade¬ 
off analyses mentally. A method of design 
synthesis, therefore, is necessary which is able 
to include all requirements on the system and 
the requirement of minimum cost in a unified 
design procedure. One such method for opti¬ 
mal structural design is illustrated in this 
chapter. 

7-1.2 HISTORICAL DEVELOPMENT 

Very early in the development of me¬ 
chanics of materials, methods of determining 
stress and displacement for given bodies under 
the action of given forces were emphasized. 
As these methods became better developed, 
the question arose as to how a structure might 
be proportioned to satisfy certain require¬ 
ments and be best in some sense. Problems of 
this kind were considered by Lagrange (Ref. 
2) in 1771 and by Gausen (Ref. 3) in 1851. 

Until very recent years, methods of the 
calculus of variations were not sufficient for 
treating realistic design problems. Probably 
for this reason, design problems were stated in 
terms of a few parameters that specified the 
structure. For example, uniform beams of 
undetermined depth are placed in a given 
configuration. The depths are then deter¬ 
mined so that the structure supports the given 
loads and is as light as possible. For a detailed 
bibliography of this development through 
1963, see Ref. 4. For a more current bibliog¬ 
raphy, see Ref. 5. 


7-1 


Digitized by 



AMCP 706-192 


Another important method of design devel¬ 
oped principally by Prager and Drucker (Refs. 
6,7,8) is limit analysis. In this method of 
design, the structure is allowed to reach a 
state of collapse due to plastic action of the 
material. The resulting design is, therefore, 
safe for application of the given loads even 
though permanent deformation of the struc¬ 
ture results. If the loads must be applied 
many times in the life of the structure, it will 
generally be required that all material in the 
structure must remain in the elastic range at 
all times. For this reason, methods had to be 
developed for elastic design. 

In 1960, Joseph B. Keller published an 
article on column design (Ref. 9) which 
renewed interest in elastic, minimum weight 
design. Several papers have subsequently been 
published by Keller and his associates in 
which a class of eigenvalue problems is treated 
(Refs. 10,11,12). The methods employed in 
these papers are elegant but are not easily 
adapted to realistic engineering problems. 

A new method of optimal design has been 
developed by J.E. Taylor and W. Prager since 
1967 (Refs. 13,14,15). This method is based 
on an energy representation of the structural 
element under consideration. A particularly 
nice feature of the method is the ability to 
obtain sufficient conditions for certain classes 
of design problems. However, no unified 
method of constructing solutions has been 
presented. 

7-1.3 METHODS EMPLOYED 

The theorems of (Chapter 6 will be em¬ 
ployed here for the solution of optimal design 
problems. Use of the results of (Chapter 6 to 
construct solutions of optimal design prob¬ 
lems is called an indirect method of solution. 
This is so, because one first obtains a set of 


conditions that the solution of the optimal 
design problems must satisfy. Once this task is 
complete, the design problem is reduced to 
the determination of solutions of the neces¬ 
sary conditions that are candidate solutions of 
the optimal design problem. The term “in¬ 
direct” seems to describe this process quite 
well. 

As discussed in par. 6-5, any method of 
solving the nonlinear boundary-value problem 
contained within the necessary conditions is 
admissible. In this chapter, two problems will 
be solved by shooting techniques. The prob¬ 
lems of par. 7-2 are treated by the shooting 
technique of par. 6-5. The problems of par. 
7-3, however, are treated by a modified 
shooting technique. 

7-2 A MINIMUM WEIGHT COLUMN 

A lightweight column of length T is to be 
designed to support a given load P. The 
material is specified and has yield strength 
^max- particular support considered is 
shown in Fig. 7-1. In problems considered 



(A) Undeflected (B) Deflected 

Figure 7- 1. Column Under Consideration 

here, the cross section is assumed to depend 
on only one design variable, m(/), 0 < r < T. 
The problem is to determine u{t) that mini¬ 
mizes the weight or, equivalently, the volume 
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A[u{t)]dt 


and satisfies the conditions, 


d^x 

Em—^px = o, 


x{0) = 0,^iD = 0 

dt 


and 


ib =-— a <0 

^ A(u) 


(7-1) 


1 (7-2) 


(7-3) 


where 

ac(r) = lateral deflection of the column 
t = distance measured along the column 
A(u) = area of the cross section 


the optimal control problem considered in 
par. 6-4. 

For use in Theorem 6-7, construct 

P 1 

Aiu) "'"‘’‘J 

G = X|JC|(0) + X2JC2(70* 

Conditions, Eqs. 6-121 and 6-124, yield 

d\i bH ^ PXj 

dt bxi EI{u) 

d\2 _ bH , 

dt djcj "" ' 

Xa(0) = 0,Xi(r) = 0. 

The system, Eq. 7-5, reduces to 




I(u) = smallest moment of inertia of the 
area of the cross section about a centroidal 
axis. All cross sections are assumed to have 
two orthogonal axes of symmetry with P 
acting through their intersection. 


^ /^Xa 
dt^ ” EI(u) 

X2(0) = o.-^(r) = o. 

at 


I (7-6) 


By defining jc, = x and Xj = dx^ldt, Eq. 
7-2 reduces to the system 


_ -t 


dX2 _ Px^ ^ 

dt " EI{u) 


(7-4) 


(0) == 0, X2(T) - 0 . j 

The problem is thus reduced to the form of 


Eq. 7-6 for X 2 (/) is identical to Eq. 6-14 for 
x(/). Both problems are homogeneous, how¬ 
ever, so X 2 (/) and x{t) may differ by an 
arbitrary constant multiplier, say Xo, i.e., put 
Xaf^) = Xox(r). This problem is normal (Ref. 
17), so Xo 0 may be chosen as one. 


Condition, Eq. 6-128, of Theorem 6-7 is, in 
this case, yL{PIA{u) - (7^^^^^^^) =0. Two 
possibilities now exist; either ^ = 0, or PlA{u) 
~ ^max ” second case, u is just the 

algebraic solution of 
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(7-7) 


In the remaining case, ^ - 0 and condition, 
Eq. 6-122, of Theorem 6-7 is 

.0, 

bu bu E y bu j 

(7-8) 

The design variable u(t) is thus determined 
in subintervals of [0,n by either Eq. 7-7 or 
7-8. So that the results of the present method 
may be compared with those obtained by 
Keller (Ref. 9), choose A - u and I = otu^. 
This corresponds to having the geometric 
shape of the cross section fixed and allowing 
all dimensions to vary as . 


With this form of A{u) and /(u), Eqs. 7-7 
and 7-8 become 

= (7-9) 

and 


1 - 


2Px^ 

Eclu^ 


= 0. 


(7-10) 


Condition, Eq. 6-131, of Theorem 6-7 
requires that the expression for u be chosen 
which satisfies the constraint, Eq. 7-3, and 
makes H as laige as possible. This criterion 
yields the choice between 

or (^) ' x’"}?.!!) 

When Eq. 7-11 is substituted into Eq. 7-2, 



74 


x(0) = 0,-^(71 = 0 
at 

where the choice on the right side of Eq. 7-12 
must correspond to the selection in Eq. 7-11. 

This boundary-value problem is solved by 
an iterative method based on Newton’s 
Algorithm. The missing initial condition is 
taken as djc/dr(0) = C. Integration of the 
resulting initial value problem from 0 to T 
yields an error dxldt{T;C) in the final value. 

This notation is chosen to emphasize the 
dependence of x on the estimate C of the 
missing initial condition. The objective is to 
find C so that dxldt(TX^ = 0. Once C is 
found, the initial value problem for x(/) may 
be solved and uit) determined from Eq. 7-11. 

In order to employ Newton’s Algorithm, 
b/bC ldx/dt(T;Q] is needed. It is obtained 
by formally differentiating Eq. 7-12 with 
respect to C to obtain 

dt^ EaP I 



f(0) = 0.^(0) = 1 J 

where the choice on the right side of Eq. 7-13 
must correspond to the selection in Eq. 7-11. 
The order of taking derivatives has been 
changed and the notation f = bx/bC intro¬ 
duced in obtaining Eq. 7-13. 

The iterative method for determining C is 
then: 

Step 1. Make estimate C = C© 
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Step 2. Integrate the differential equation 
in Eq. 7-12 with 

(0) = Co, x(0) = 0, and Eq. 7-13. 
at 

Step 3. Make the adjustment in C 

Step 4. Return to Step 2 with new esti¬ 
mate C = Cl, and repeat. 

The equations derived here must be 
changed only slightly to solve column prob> 
lems with other end conditions and other 
forms of cross section. 

For a numerical example of this problem, 
let the cross section be circular with variable 
radius. In this case a = l/(4a‘). For the exam¬ 
ple, let = 20,000 psi, £ * 3x10^ psi, and 
r* 10 in. 


A FORTRAN program was written to 
perform the iterative procedure. The program 
was run on an IBM 360-65 Computer and 
required approximately 0.1 sec per iteration 
and only four to six iterations to converge. 

The results for this design problem are 
given in Table 7-1 and Fig. 7-2. For loads 
above 6794 lb, the cross-sectional area is 
determined \>y A ~ and the resulting 

column is stable. A meaningful optimal design 
problem then exists only for P < 6794. 

7-3 A MINIMUM WEIGHT STRUCTURE 
WITH ANGULAR DEFLECTION RE¬ 
QUIREMENTS 

7-3.1 STATEMENT OF THE PROBLEM 

The problem considered in this paragraph is 
the design of a portable communication tower 
of height L which will support a lineof-site 
transmission unit, a laser transmitter, for 
example. In order for the transmission beam 


TABLE 7-1 

RESULTS FOR COLUMN PROBLEM 


Voluffit of 

P, lb Volumt, in.^ Unfform Column', in. ^ Saving, % 


50 

0.260 

0.291 

10.6 

100 

0.361 

0.412 

12.4 

200 

0.507 

0.595 

14.7 

500 

0.806 

0.923 

12.7 

1000 

1.140 

1.300 

12.3 

1500 

1.408 

1.600 

11.9 

2000 

1.640 

1.840 

10.9 

3000 

2.048 

2.260 

9.3 

4000 

2.412 


7.3 

5000 

2.765 

2.910 

5.2 

6794 

3.397 

3.397 

0.0 


' Minimum Weight Uniform Column 
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P*50 P«100 i»*500 P- 1000 P-1500 



P-2000 P-3000 P-5000 P-6794 


Figure 7-2, Profiles of Optima! Columns 

to hit the receiving unit, the top of the tower 
on which the transmitter unit is mounted 
must undergo only a certain allowable rota- 
tion d when the tower is exposed to a given 
extreme uniform wind load Q pounds per unit 
length of tower. 

It is required that the tower be as light¬ 
weight as possible so that it may be trans¬ 
ported and erected without the aid of heavy 
machinery. For this reason, the design crite¬ 
rion is minimum weight. However, one addi¬ 
tional requirement must be placed an the 
tower. In transportation and election it must 
be strong enough so that it is not damaged by 
rough treatment. Therefore, it is required that 
the moment of inertia of the cross-sectional 
area of the tower be greater than a prede¬ 
termined limit Iq everywhere. 

7-6 


The general configuration of the tower is 
shown in Fig. 7-3. Three vertical members 
with cross-sectional area A(t) are arranged on 
the vertices of an equilateral triangle of 
altitude h(t). Here / is a coordinate measured 
along the length of the tower. In order to 
maintain the spacing of the vertical elements, 
smaU cross members are inserted. 



Figure 7-3. To\^ Considered 


It is assumed that the tower is constructed 
of a given material with density p. Further, it 
is assumed that jS/t cubic units of material are 
required per unit height of tower in order to 
maintain the spacing of the vertical elements. 
The coefficient P is to be determined from 
design experience. For this configuration the 
total weight of the tower is 

H/= I 3plAit)^phit)]dt. 

Jo 

Since 3p is a constant, W is minimized if and 
only if 

K= I M(r) +13/1(01 d/ (7-14) 

Jo 

is minimum. The objective in the design 
problem is to choose i4(/) and h{t) for 0 < / < 
I, so that V is as small as possible and the 
given conditions are met. 

Lateral deflection of the tower due to the 
lateral wind load is determined by elementary 
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beam theory. The differential equation for 
displacement is (Ref. 14) 

= - Af (0 (7-15) 

where 

E = Young’s modulus of the material 

/(/) = minimum moment of inertia of 
the cross-sectional area of the 
tower 

X = lateral displacement 



M{t) = bending moment of tower. 

The moment of inertia of the cross section is 

I /(0 = -X(0(/»(01*. (7-16) 

In order to prevent damage in handling, it is 
required that 

liO > /o > 0, 0< t < L. 

or in the notation of par. 6-4, 

0 = /o 0, 0< / < £ 

If 1(0 = Iq gives a tower with angular 
deflection less than or equal to d at the top, 
then this is the optimal tower and no further 
work is required. On the other hand, if this 
tower has angular deflection greater than d, 
then the tower is not admissible and it is 
required that the angular displacement is 
equal to 6. This is the only situation con¬ 
sidered here. 

In order to fit this problem into the form 


considered in par. 6-4, define 

JCi =JC 

xi = x\ 

with this notation and that of par. 6-4, the 
second-order equation 7-2 is equivalent to 

x\=^X2=fi \ 



The design problem will now be solved for 
two admissible configurations of the tower. 

In order to compare results obtained for 
the various configurations considered, a tower 
with properties of Table 7-2 will be treated. 


TABLE 7-2 
CONSTANTS 

Z. = 360 in. e ^ 0.0001 rad 

Q * 8.35 Ib/in. /? - 0.25 in. 

£ * 3x 10^ Ib/in. /o * 172.8 in.^ 


7-3.2 TOWER WITH ONE DESIGN VARI¬ 
ABLE 

For the problems considered in this para¬ 
graph, A(t) wiU be held constant with the 
value of A. Two ways of mounting the tower 
on the earth will be chosen. The first method 
will be to fix the base of the tower rigidly to 
the earth and leave the top unsupported. The 
second method will be to pin the lower end to 
the earth and support the top with guy lines. 
Since A is constant 


I 
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( 


’■ \ 


+ h(t)]dt 


1 ‘ 


= ^I + j3 I hiOdt. 


Minimization of F in this case is equivalent 
to minimization of 


- r 




(7-18) 


In the notation of par 6-4, 
fo=h(t). 


par 7-3.1. Eqs. 7-17 become 
x'i=X2 = fi 

. 3Q(L - ty ^ . 

” 4EAh^ 

j 

with boundary conditions 
x,(0) = 0 
x,(0) = 0 

Xi{L) = e J 


(7-20) 


(7-21) 


The problem stated here will now be solved 
using Theorem 6-7. Define 


7-3.2.1 METHOD 1. TOWER WITH BASE 
RIGIDLY FASTENED TO THE 
EARTH 

The tower considered here is shown in Fig. 
7-4. The bending moment Af(t) due to the 



wind load Q is 

mo = -j(l - 0^ (7-19) 

where t is measured upward from the bottom 
of the tower; the other symbols are defined in 

7-8 



and 

= 7i 1 (0) + 72 ^2(0) + 73 ^2 ^ 1 • 


The conditions of Theorem 6-7 yield 


X', =0 


■ (7-22) 

Xi = - X, 



0 = - Xo - Xj 

r 6Q(L - 0"' 

L 2EAh^ . 


4 

+ ti—Ah 


(7-23) 

X,(L) = 0. 


(7-24) 


The general solution of Eq. 7-22 is 

I 
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X|(0 = ti 


Since Cases 1 and 2 cover all possibilities. 


X2(0 = -t2 -iit. 

Condition, Eq. 7-24, implies = 0, so 
Xi(O = 0 


(7-25) 


X2(0 = -t2. 

For the determination of hit), two cases 
must be considered: 

Case 1: 0 = 0. In this case 



(7-27) 


where by Eq. 6-131, the choice in Eq. 7-27 
must be made which makes 




(?(!-/)* 

4EAh'^ 


-/o=0 



Case 2: 0 < 0. In this case Eq. 6-128 is 


a maximum. 

Let t* be a point of transition from one 
expression in Eq. 7-27 to the other. Since 
Theorem 6-7 requires H to be continuous, 
when the two expressions of Eq. 7-27 evalu¬ 
ated at are substituted into H, a common 
value must occur. 

Evaluate ^ as a function of both and 
^1- 


and since 0 # 0, /x = 0. Substituting this result 
into Eq. 7-23, 


-Xo 


, 6i2Q(L-t)^ 
2EAh^ 


= 0. 


(7-26) 


= =-hi 


1 pt, (?(£-/)» ! 1 

2 1 2EA \ h\ 


If Xo = 0, then, = 0 and from Eq. 7-26, 
Xi = X, = 0. This, however, violates the 
condition stated in the first sentence of 
Theorem 6-7. Therefore, X© ^ 0 and it is 
permissible to choose Xo = 2. 

Further, since Xo > 0, (2 > 0 in order that 
Eq. 7-26 hold. Eq. 7-26 then yields 





i rataQd-oO 1 
2 L 2£yl \h\ 




L 2£^ J 


h/,2 



I 
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Figure 7-2, Profiles of Optima! Columns 

to hit the receiving unit, the top of the tower 
on which the transmitter unit is mounted 
must undergo only a certain allowable rota- 
tion d when the tower is exposed to a given 
extreme uniform wind load Q pounds per unit 
length of tower. 

It is required that the tower be as light¬ 
weight as possible so that it may be trans¬ 
ported and erected without the aid of heavy 
machinery. For this reason, the design crite¬ 
rion is minimum weight. However, one addi¬ 
tional requirement must be placed on the 
tower. In transportation and election it must 
be strong enough so that it is not damaged by 
rough treatment. Therefore, it is required that 
the moment of inertia of the cross-sectional 
area of the tower be greater than a prede¬ 
termined limit Iq everywhere. 

7-6 


The general configuration of the tower is 
shown in Fig. 7-3. Three vertical members 
with cross-sectional area A{t) are arranged on 
the vertices of an equilateral triangle of 
altitude h{t). Here / is a coordinate measured 
along the length of the tower. In order to 
maintain the spacing of the vertical elements, 
smaU cross members are inserted. 



Figure 7-3 Tower Considered 


It is assumed that the tower is constructed 
of a given material with density p. Further, it 
is assumed that jS/t cubic units of material are 
required per unit height of tower in order to 
maintain the spacing of the vertical elements. 
The coefficient is to be determined from 
design experience. For this configuration the 
total weight of the tower is 

H'= I 3p[A(t)-^mt)\dL 

Jo 

Since 3p is a constant, W is minimized if and 
only if 

K= I (7-14) 

Jo 

is minimum. The objective in the design 
problem is to choose i4(/) and h{t) for 0 < r < 
I, so that V is as small as possible and the 
given conditions are met. 

Lateral deflection of the tower due to the 
lateral wind load is determined by elementary 
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beam theory. The differential equation for considered in par. 6-4, define 
displacement is (Ref. 14) 

Xi =JC 

£/(/)x'' = -Af(t) (7-15) 

X 2 = x\ 

where 

with this notation and that of par. 6-4, the 

E = Young’s modulus of the material second-order equation 7-2 is equivalent to 


/(r) = minimum moment of inertia of 

the cross-sectional area of the 
tower 

jc = lateral displacement 



x\ =/i 



(7-17) 


The design problem will now be solved for 
two admissible configurations of the tower. 


M(t) = bending moment of tower. 

The moment of inertia of the cross section is 


In order to compare results obtained for 
the various configurations considered, a tower 
with properties of Table 7-2 will be treated. 


I = (7-16) 

In order to prevent damage in handling, it is 
required that 

I(t) > /o > 0, 0 < / < I. 

or in the notation of par. 6-4, 

0 = /o - /(/) < 0, 0 < / < I. 

If /(/) = /q gives a tower with angular 
deflection less than or equal to d at the top, 
then this is the optimal tower and no further 
work is required. On the other hand, if this 
tower has angular deflection greater than 9, 
then the tower is not admissible and it is 
required that the angular displacement is 
equal to 6 . This is the only situation con¬ 
sidered here. 

In order to fit this problem into the form 


TABLE 7-2 
CONSTANTS 

Z. * 360 in. e ^ 0.0001 rad 

Q = 8.35 Ib/in. p * 0.25 in. 

£ « 3x 10^ Ib/in. /o = 172.8 in.^ 


7-3.2 TOWER WITH ONE DESIGN VARI¬ 
ABLE 

For the problems considered in this para¬ 
graph, Ait) will be held constant with the 
value of A . Two ways of mounting the tower 
on the earth will be chosen. The first method 
will be to fix the base of the tower rigidly to 
the earth and leave the top unsupported. The 
second method will be to pin the lower end to 
the earth and support the top with guy lines. 
Since y4 is constant 
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y= [A 

J 0 


+ h(t)]dt 


r 


= yll+j3 I hiOdt 


Minimization of F in this case is equivalent 
to minimization of 


J= I hit)dt 

Jo 

In the notation of par 6-4, 


(7-18) 


par 7-3.1. Eqs. 7-17 become 


x\ 


= fi 


. 3QiL -/)*_. 

" 4EAh^ - 

with boundary conditions 
x,(0) = 0 
x,(0) = 0 
x,(I) = d 


(7-20) 


(7-21) 


The problem stated here will now be solved 
using Theorem 6-7. Define 


7-3.2.1 METHOD 1. TOWER WITH BASE 
RIGIDLY FASTENED TO THE 
EARTH 

The tower considered here is shown in Fig. 
7-4. The bending moment MU) due to the 



wind load Q is 

^(/) = -Y(I-t)* (7-19) 

where t is measured upward from the bottom 
of the tower; the other symbols are defined in 

7-8 



and 


(? = 7i^1 (0) + 72^2(0) + 73 IXiiL) -0]. 

The conditions of Theorem 6-7 yield 


X', =0 

Xi=-X, 

0=.^=-Xo-X, 

r 6Q{L - 0*1 

(7-22) 

dh ® * 

L 2EAh^ J 


4 

+ 11—Ah 


(7-23) 

X,(L) = 0. 


(7-24) 


The general solution of Eq. 7-22 is 
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^i(0 “ 


Since Cases 1 and 2 cover all possibilities, 


Condition, Eq. 7-24, implies fi = 0, so 


X,(r) = 0 


(7-25) 


X2(/) = -t2. 

For the determination of hit), two cases 
must be considered: 

Case 1: 0 = 0. In this case 



where by Eq. 6-131, the choice in Eq. 7-27 
must be made which makes 


H=-h- 


3i2QiL-t)^ 

4EAh^ 



so 


/I* 



= /ii 


a maximum. 

Let t* be a point of transition from one 
expression in Eq. 7-27 to the other. Since 
Theorem 6-7 requires H to be continuous, 
when the two expressions of Eq. 7-27 evalu¬ 
ated at are substituted into H, a common 
value must occur. 


Case 2: 0 < 0. In this case Eq. 6-128 is 


/10 = O 


Evaluate ^ as a function of both and 

hi. 


and since 0 # 0, p = 0. Substituting this result 
into Eq. 7-23, 


— Xo + 


6i2QiL-t)^ 

lEAh^ 


= 0. 


(7-26) 


If Xo "= (^» then, ^2 = (^ and from Eq. 7-26, 
Xi = Xj = 0. This, however, violates the 
condition stated in the first sentence of 
Theorem 6-7. Therefore, Xo ^ 0 and it is 
permissible to choose Xo = 2. 


Further, since Xo > 0, {2 > 0 in order that 
Eq. 7-26 hold. Eq. 7-26 then yields 


/i = 


\^UQ{L - 




= =-A, 


I pbOa-O^ I 1 

2L 2EA ] h\ 



H(h2) = H2=‘-h2 


1 [ 3 ^ 6 ( 1 - 0^ 1 1 

2 L 2EA \h\ 
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Then 


H2 -Hi --y/li 


= ^( 2^1 ^h2)ihi -h2?. 

If h 2 < hi at any point /, then /12 violates 
the constraint and /i = /ii. If /12 = at any ?, 
there are no alternative choices for/t. If /t 2 > 
hi at any point /, the choice h - hi must 
maximize//(/); this implies 7/2 -//i < 0. But 
this is impossible from the above because h | 
and /22 are always non-negative. Therefore, if 
h 2 > hi, then it is required that /z = /I 2 • From 
this, it is concluded that 


form for 0 < t < t*. The problem is now to 
determine {2 • 


The condition which has not yet been 
satisfied is X 2 {L) = d. Substituting Eq. 7-27 
into Eq. 7-20 yields 





^2 


= < 


ioT 0 < t < t* 


QiL - t? 

\ 2£/o 


iox t* < t < L 


(7-29) 


h(t) = maxlhi(t),h2(t)] . 

Since h(t) is defined as the maximum of two 
continuous functions, h is continuous. It 
follows that all points t* of transition from 
one value of Eq, 7-27 to another can be found 
by equating the two expressions of h(t). 


The point is then determined by 
1/2 

/ \ 

h. 




_ \^UQiL - 

V 2EA J 


hiit*). 


This solution yields two values of The 
requirement 0 < < L results in a unique 

value of t*. 



For 0 < t < L, the first form of h in Eq. 
7-27 is monotone decreasing and is zero at / = 
L. It is, therefore, clear that the second form 
of h must hold for < r < L and the first 

7-10 


with X 2 ( 0 ) = 0. 

Integrating Eq. 7-29 first from 0 to (as 
given by Eq. 7-28) and then from to L 
yields 


X2{L)^ 


3L 


5/3 


10 

1 / 25 / 123/4 

iQ 


\2EA/ 




X 3 + 10/o^®)^-3/2 


(24/"* 


(7-50) 


Eq. 7-30 is solved numerically for { 2 - Once 
^2 is determined, then Eqs. 7-27 and 7-28 
completely specify the tower. Results are 
shown in Table 7-3(A) and Fig. 7-5(A). 


This design problem has been solved analyt¬ 
ically. As will become apparent as more 
realistic problems are treated, one should not 
expect to obtain solutions in this way. In 
most problems, numerical methods must be 
applied to solve the differential equations 
arising in the Theorem 6-7. 
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TOWERS, ONE DESIGN VARIABLE TOWERS, ONE DESIGN VARIABLE 
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Figure 7-5. Tower With Base Rigidiy Fastened 
to the Earth 


7-3.2.2 METHOD 2. TOWER WITH BASE 
PINNED TO EARTH AND WITH 
TOP SUPPORTED BY GUY LINES 

The tower considered here is shown in Fig. 
7-6. It is convenient here to locate the 
coordinate system at the top of the tower. 
The bending moment generated by the uni¬ 
form wind load Q is M - - Q/l t(t - I) so 
the differential equation for bending is 

EI(h)x" = ^ t{t-L). 

Define Xi - x and X 2 = x \; this is equiva- 
lent to 


xl s/, 


Qt(t-L) _ 
* 2EI 


(7-31) 


The boundary conditions in this case are 

x,(0) = 0 

^i(O) = e } (7-32) 

Xi(L) = 0 


The quantity to be minimized is still given 
by Eq. 6-17. In the problem considered here, 

-\oh-g(Io -^Ah^) 

C = 7iX,(0)+7jU2(0)-d) +73X,(I). 

Conditions of Theorem 6-7 are 


^ dH 
dt ~ dx 


dt 


dH 

dX2 




so 



Figure 7-6. Tower With Guy Lines 


^2(0 ~ “ tif • 

Also, 

X2a) = 0 

This equation implies {2 " — ti so 
Xi=ti 


7-12 
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X2=«i(I-0. 


Two cases must now be considered: 
Case 1. 0 ~ 0. In this case 

-/o =0 


SO 


-(i^r 


(7-33) 


Case 2. 0 < 0. In this case /i - 0. 
Substituting into bHIhh = 0, 




it(L-t)6Qt(t-L) 


2EAh^ 


= 0. 


As in the previous case Xo # 0 so it is 
permissible to put Xo == 2 and obtain 


2EA 


(7-34) 


Eqs. 7-33 and 7-34 along with Eq. 6-131 
yield 


h(t) 


■Ktr - 

I 


(7-35) 


the choice in Eq. 7-35 being made which 
makes 


H^-h- 


4EAh^ 


largest. 


The problem Is thus solved when is 
determined. In this case, analytical integration 


of Eq. 7-31 and solution of the boundary 
conditions, Eq. 7-32, for are not feasible. 
Therefore, the shooting technique of par. 6-5 
is employed. Numerical results for this prob¬ 
lem are given in Fig. 7-7(A) and Table 7-3(B). 



(A) One Control 
Variable 


(B) Two Control 
Variables 


Figure 7-7. Tower With Base Simply Supported 
and Top Supported With Guy Lines 


The modification of Newton’s Algorithm 
consists of using a correction of at most 10% 
of the values of the unknown iteration param¬ 
eters. It has been found in particular problems 
that where the Newton Method fails to 
converge, this method will converge. The rate 
of convergence, particularly for the first few 
iterations, is slowed by the modification, 
however. 

7-3.3 TOWER WITH TWO DESIGN VARI¬ 
ABLES 

The same two methods of supporting the 
towers will again be considered. Here, how¬ 
ever, both A and h will be allowed to vary 

7-13 
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along the tower and play the role of design is permissible to take Xq = 1. The system is 
variables. then two equations for h and A with solution 


Eqs. 7-16 and 7-17 remain the same in the 
present problem. However, from Eq. 7-14 

/o=^(/) +0/1(0. 

7-3.3.1 METHOD 1. TOWER WITH BASE 
RIGIDLY FASTENED TO THE 
EARTH 

Fig. 7-4 applies and Eqs. 7-19, 7-20, and 
7-21 hold in this problem. Substitution into 
Eq. 6-108 yields 



The equations for X| and X 2 are just as in the 
preceding work, so again, 

Xi(0 = 0 

XjCO ^ — ^2 • 

Since h plays the role of Ui and A the role of 
U 2 y Eq. 6-122 is: 


dh 


= - Xo0 + 


€2 36(1 - 0 ^ 

2EAh^ 


+ ^Ah = 0 


bA ® ^EAh 



= 0 . 


(7-36) 

As before, two cases must be considered: 

Case 1. 0 < 0. This implies fi ~ 0. From 
Eq. 7-36, it is clear that Xq = 0 implies £ 2=0 
which contradicts Theorem 6-7. Therefore, it 

7-14 


1(7-37) 

.< 0 - (i!^f I 


Case 2. 0 = 0. This is 

jAh^ =/o 


(7-38) 


Eq. 7-38 along with Eq. 7-36 is a system of 
three equations in h, Ay and /i and solving for 
h and A yields 



The design variables are chosen by Eqs. 
7-37 and 7-39, depending on which makes H 
largest. The problem with differential equa¬ 
tions, Eq. 7-20, and boundary conditions, Eq. 
7-21, is now treated by the shooting tech¬ 
nique of par. 6-5. Numerical results are given 
in Table 7-3(C). 


7-3.3.2 METHOD 2. TOWER WITH BASE 
PINNED TO EARTH AND WITH 
TOP SUPPORTED BY GUY LINES 


Fig. 7-6 applies and Eqs. 7-31 and 7-32 
hold for this problem. Substituting into Eq. 
6-108 yeilds 


H-\xX2 +X 2 



-Xo(A+0/i)- 
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The equations for X] and Xj are just as in 
the preceding case, so again 

“ ti 


Xa=ti(I-0 


Eqs. 6-122 for the design variables are 


— 

dh 


= -Xo|3 + 


ii3Qt(L-ty 

2EAh^ 


\ 


+ yi^Ah = 0 

i,3Qt(L-ty 
dA ® 4EA^h^ 


\ (7^0) 


+ M3 A* =0. 


As before, two cases must be considered: 


Gise 1. ^ < 0. This implies /i = 0. 

From Eq. 7-40, it is clear that if Xq = 0, then 
Ji = 0, and Xj = X 2 = 0. This contradicts the 
theorem, so Xq ^ 0 and it is permissible to 
take Xo = 1. The system is then a set of two 
equations for h and A which yields 

m = a - 0*/^ I 

>(7-41) 


Case 2. 0 = 0. This is 

■|^A^=/o. (7^2) 

Eq. 7-42 along with Eq. 7-40 is a system of 
three equations for K A, and ti. Eliminating /i 
and solving for h and A yields 




I (7-43) 


The design variables are chosen as in Eqs. 
7-41 or 7-43, depending on which makes H 
largest. The problem with differential equa¬ 
tions and boundary conditions, Eq. 7-20, is 
now solved by the shooting technique. Nu¬ 
merical results are given in Table 7-3(C). 


7-3.4 DISCUSSION OF RESULTS 

For both types of tower considered (simply 
supported and towers with top supported by 
guy lines), the variables A and h could be 
fixed at constant values, large enough that 
deflection requirements are met. For a given 
configuration of the tower, there is one pair 
of constant values A and h which yield a 
tower at least as light as any other combina¬ 
tion of constant A and h. For both types of 
tower, finding these values is a matter of 
simple algebra. Results for both towers are 
given in Table 7-3(C), referred to as towers 
with no design variables. With A held constant 
and h(t) treated as a design variable, the 
problem can be solved for several different 
values of A for both types of tower. Sum¬ 
maries of these solutions are given in Table 
7-3(A) and Table 7-3(B). If the tower weights 
are then plotted as a function of fixed values 
of i4, a minimum, or best, weight can be foimd 
(Table 7-3(C)). These optimum towers show a 
reduction in weight over the no design vari¬ 
able case of 13.5% and 13% for the simply 
supported and guy-line supported towers, 
respectively. Results for solutions of the 
problem when both44(/) and h{t) are treated 
as design variables are also given in Table 
7-3(C). These represent reductions in weight, 
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over the case where only the spacing h{t) is 
allowed to vary, of 6.1% and 6.7% for the 
simply supported and guy-line supported 
towers, respectively. Similarly, these towers 
represent respective reductions in weight over 
the completely uniform (no design variables) 
tower of 18.8% and 19%. 

All four configurations of the structure 
described in the preceding subparagraphs have 
been successfully prescribed by a digital com¬ 
puter approach. An IBM 360-65 Computer 
was used and the programs employed Runge- 
Kutta integration with the Newton’s Method 
described in the text. Convergence depended 
on getting a good starting value for the 
multiplier This was made more difficult by 
the fact that i has no physical significance so 
that, consequently, engineering intuition was 
no help. To fmd a sufficiently close starting 
value for f, (i.e., a value for which the error is 
of order two or less) several different values 
of t were investigated, each increasing from 
the previous value by a factor of 10, and the 
first value very close to zero. Once a starting 
value that would allow Newton’s Method 
iterate effectively was found, convergence 
occurred in ten or less iterations, taking less 
than two minutes of computer time. This 
time could be reduced with increased sophisti¬ 
cation of the computer program. 

Other optimal design problems can be 
approached with the method of this para¬ 
graph. Also, different parameters could be 
treated as variables. For instance, several 
materials of different densities and stiffness 
characteristics could be used in the same 
structure, and the choice could be left as a 
design parameter. The loading could be a 
function of the height above the ground 
rather than be constant. Other restrictions 
might also be imposed; maximum width or 

7-16 


maximum deflection at a given point could be 
among these. 

Note that Figs. 7-5 and 7-6 arc not scale 
drawings of the towers, but are representative 
of the general shape of the respective towers, 
as viewed on one face. Sample profiles of the 
four possible structures are presented in Table 
7-3(A), (B), and (C). 

7-4 MINIMUM WEIGHT DESIGN OF 
BEAMS WITH INEQUALITY CON¬ 
STRAINTS ON STRESS AND DEFLEC¬ 
TION 

The problems treated thus far in this 
chapter have only design variable inequality 
constraints. In engineering design one often 
encounters problems in which it is required 
that the state of the system satisfies inequal¬ 
ity constraints. As seen in (Chapter 6, state 
variable constraints are more tedious to treat 
and have features not encountered in prob¬ 
lems without state constraints. A class of 
beam design problems including state variable 
constraints is presented in this paragraph to 
illustrate some of the features and difficulties 
that can arise in this difficult class of prob¬ 
lems. While the problems solved are of limited 
practical value, they do illustrate typical 
features that can arise in state variable con¬ 
strained problems. 

7-4.1 STATEMENT OF THE PROBLEM 

Beams which are loaded in a general way 
(such as in Fig. 7-8) are considered in this 
paragraph. The cross sections of the beams are 
assumed to depend on a vector parameter u(t) 
= [mi(0 . W 2 ( 0 , .... “mWl and to be sym¬ 
metric with respect to vertical and horizontal 
axes. The vertical axis of symmetry is as¬ 
sumed to lie in the plane of loading. The 
beams are made of a homogeneous, isotropic. 
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Figure 7-8. Beam Loaded in a General Way 


linearly elastic material with Young’s modulus 
E. Small deflection, elementary beam theory 
is used throughout this paragraph. Also, the 
effect of the weight of the beam on deflection 
is neglected. 

Since, for a particular beam, the cross 
section is determined by u(/), 

A{t)-A [M(r)] (cross-sectional area), 

(7-44) 

/(/) -1 [u{t)] (moment of inertia), 

(7-45) 

b{t,d) = b[u{t)4] (width of cross section 
at d). 


• (7-46) 

Q{t,d) = Q{u{t)4\ (first moment of area 
above d, about the 
neutral axis), 


(7-47) 


and 

0(0 = C[u{t)] (half-depth of beam), 

(7-48) 


where d is the distance above the neutral axis 
of the cross section. 

For each /, let d^ [/,w(0,Af(0,K(01. i = 1 
and 2, be the distances from the neutral axis 
where 


^2 


2 L 

m J 

+1. 


^2 

[ /'(/) 


/K"(0QMf,di(0] \ 


and 



MHOdlU) 

IHO 


( VH0Q^[t42U)] \ 

\ nU)bHt42(0]/ 


1/2 


respectively, are the maximum principal 
stresses that occur in the cross section at t. 
The values d^ and d 2 may be determined by 
the methods of ordinary calculus, for each t. 


The problem is to determine u(t) sc that 
the beam, subjected to a given loading, 
contains as little material as possible and still 
satisfies the following conditions: 

1 . Principal normal stress is less than or 
equal to some allowable normal stress 
a 

max 

2. Principal shear stress is less than or 
equal to some allowable shear stress 

ax * 

3. Stiffness is bounded away from zero 
(otherwise an infinitesimal change in 
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load can cause the deflection to be 
discontinuous). 

4. Beam deflection at each point is 
bounded by two given functions A'j (/) 
and i.e., Xi(t) < x{t) < X 2 (t) 

with A'lf/) < ATjCr). 

In structural design problems, it is fre¬ 
quently sufficient to require only that the 
maximum flexural stress be less than 

m ax 

and the maximum direct shear stress be less 
than These conditions are considerably 
easier to enforce than the conditions on 
maximum principal stress. 

If the beam is subjected to several loadings, 
then the problem is more tedious but is no 
more difficult mathematically. Corresponding 
to each loading there is a a deflection curve, 
bending stress, and shear stress that must 
satisfy the stated conditions. 

Further, since the beams are made of 
homogeneous material, the weight of a beam 
will be minimum if and only if its volume is 
minimum. Therefore, in the following the 
quantity to be minimized will be volume. 

The given problem is now stated mathe¬ 
matically: A vector function u(t) is sought 
which causes the functional 

/= I Alu(t)]dt (7-49) 

^ 0 

to be a minimuin subject to the following 
conditions: 


dr* 


E/MOl 


d*x 

dr* 


= <7(0 


(7-50) 


at all but a fuiite number of points in (0, T), 


7-18 


where ^(r) is distributed load; 

g,[x<'>(0), x«)(nj = 0. s = 1. B. i < 3; 

(7-51) 

Op(0 < <r„„ (7-52) 

for all r in (0,7), where M(t) is bending 
moment; 

(7-53) 

for all / in (0,7) where V(t) is shear; 

/[w(01 > lo , (7-54) 

where /o is a constant greater than zero; and 
for all t in (0,7), 

X,{t) < x(t)< X^it), il’SS) 

It is assumed that the functions appearing 
above have the following properties: 

1 . q has a piecewise continuous derivative 
in (0,7). 

2. A, /, C, and Q are piecewise twice 
continuously differentiable. 

3. Xx and X 2 have continuous second 
derivatives in (0,7). 

A solution is sought with the following 

properties: 

1 . u^{t\ / = 1, ...,m, are piecewise continu¬ 
ous in (0,7). 

2 . x(/) is piecewise four times continuous¬ 
ly differentiable in (0,7). 

In case only maximum bending stress and 
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maximum direct shear stress are to be 
bounded by 0 ^^^^ and , respectively, 
constraints, Eqs. 7-52 and 7-53, are replaced 
by 


_ mt)\ C[uit)] 
/[m(01 


< 


(7-56) 


and 


^ \V(t)\Qlu(thd^(t)] 
“/[«i(016[«(0.d,(01 ^ 


(7-57) 


for all / in (0,7) where d^it) = d^ [/,m(/), 
Af(/), V(t)] is the distance from the neutral 
axis where the absolute value of the direct 
shear stress is largest. The distance d^{t) may 
be determined by the methods of ordinary 
calculus. 


In the case of beam design with multiple 
loading requirements, there is still just one 
design variable u(t). However, c rresponding 
to each loading there is an additional state 
variable (deflection) that must satisfy condi¬ 
tions identical in form to Eqs. 7-50 through 
7-55. The problem is to determine «(/) so that 
the functional, Eq. 7-49, is minimum subject 
to the condition that Eqs. 7-50 through 7-55 
are satisfied for each loading. 


7^.2 NECESSARY CONDITIONS FOR THE 
BEAM DESIGN PROBLEM 

The treatment which follows applies only 
to statically determinate beams, i.e., beams 
loaded in such a way that reactions at all 
supports (and hence also shear and bending 
moment) are determined completely by the 
conditions for equilibrium of the beam. 
Changes in formulation of the problem which 
are necessary to consider statically indeter¬ 
minate problems will be indicated below. 


In the statically determinate case, the 
differential equation, Eq. 7-50, and the 
boundary conditions, Eq. 7-51, reduce to 


d^x M{t) 
dt^ ^~EI(u) 


(7-58) 


and 


g, [x(0),x'(0),x(D,x'(01 = 0, 

5=1,2. (7-59) 


The boundary-value problem, Eqs. 7-58 
and 7-59, is equivalent to a boundary-value 
problem with a system of first-order equa¬ 
tions. The new problem may be written as 



dX2 

nr 


Mjt) 
mu) ’ 


(7-60) 

(7-61) 


and 


g,Ui(0),x,(0),Xi(n,X2(Dl =0, 


5=1,2 (7-62) 

where Xi is defined to be x. In terms of this 
notation, Eq. 7-57 becomes 

Xi(t)<xAt)<X2{tl 

It is now clear that the beam design 
problem is contained in the class of problems 
to which par. 6-4 applies. The quantities 
appearing in par. 6-4 will now be identified 
with the physical quantities associated with 
beam so that necessary conditions for the 
beam design problem may be stated. 

Conditions, Eqs. 7-52, 7-53, and 7-54, 
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correspond to Eqs. 6-101; Eq. 7-55 corre¬ 
sponds to two restrictions of the type ex¬ 
pressed by Eq. 6-132. The differential equa¬ 
tions, Eqs. 7-60 and 7-61, correspond to Eqs. 


6-97, where 


A =X2 

(7-63) 

1 

II 

(7-64) 

and in Eq. 6-96 


II 

(7-65) 


The ends of the beam are located at the 
known points / = 0 and t - T. Therefore, 
and in the general variational problem are 
known. Also, boundary conditions will 
generally be separated; i.e., some conditions 
will be given at 0 and others at T. 

The state variable constraints, Eq. 7-55, are 
of second-order since 

is not an explicit function of u, but 


= ^(0 + 


Em 


is an explicit function of u. The same argu¬ 
ment holds for Xi — Xi > 0. 

In terms of the notation of Eq. 6-101, 






7-20 


=/o -/(«)< 0 

=> 1(0 - YjiO < 0 


0 

( ^4. ^4 < 0 

mo 


^4,2 “ 


(7-66) 


EI(u) 


-Xl(0,<l>*=0 


and 


♦ 5,1 = ■ 


♦$>^5 < 0 

M(0 


(7-67) 


Em 


+ ^( 0.«5 =0 


Since the only explicit dependence of Eqs. 
7-52, 7-53, 7-54, 7-63, 7-64, and 7-65 on t is 
through M(t) and V(t), the points of discon¬ 
tinuity of functions (t^ in par. 6-4) arc 
denoted , including points of discontinuity 
of Af(t) or V(0; i.e., the correspond to 
points of application of concentrated loads or 
moments. Therefore, the are known. 

At each point the deflection curve is 
tangent to one of the curves 


X, = 2 r,(r) 


(7-68) 


or 


X, =X2(0 (7-69) 

and there is a neighborhood of in which 
this is the only point of tangency. 

At each r", the deflection curve becomes 

tangent to one of the curves, Eqs. 7-68 or 

7-69. Further, there is a neighborhood of 

in which Eq. 7-55 is a strict inequality to the 

left of rr, and Eq. 7-68 or Eq. 7-69 holds to 
6 
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the right of rj. At the corresponding point rj, 
the deflection curve leaves Eq. 7-68 or Eq. 
7-69. Eq. 7-68 or Eq. 7-69 then holds in (/“, 
rp, and Eq. 7-55 is again a strict inequality 
immediately to the right of t *. 

If Xi(0) or JCi(r) is not fixed by the 
boundary conditions of Eq. 7-62, then one 
part of Eq. 7-55 may be an equality at 0 or T. 
In this case 03 of Eqs. 7-66 and 7-67 need not 
be zero. 


dxi 

It 


= ^2 


(7-71) 


and 


dx2 

dt 


M(t) 

Em 


(7-72) 


at all but a finite number of points in (0,7); 

Xo> 0 


The t* are points where one or more of 
Eqs. 7-52, 7-53, and 7-54 changes from strict 
inequality to equality. Condition, Eq. 6-105, 
is assumed to hold at these points. 


According to Eqs. 6-106, 6-108 


-Ml 01 -M 2 02 - Ms 03 
— M4 04,2 Ms 05,2 


G = 


B 

S 

1 = 1 


tt 


(^"^0.4.6 04(^6)+^1,4,5 04(^5) 


>(7-70) 


■*“ ^0.5.6 05 (^ 5 ) ■•‘^1,5,6 05 (fp 


■*“ ^0,4.r 04(f;.) ■*“ ^l,4,r 04(^;.) I 

+ To.s/ 05 (^) + rl^s.r 05 (0 / 

Theorem 6-9 and Eq. 7-70 yield Theorem 
7-1. 


Theorem 7-1: Necessary conditions for the 
minimum weight beam problem are: 


Xi=fi 


X 2 = ^2 "" tl ^ 


and 

Xj + Xj + > 0 

for all / in (0,7), where fi and {2 tnay have 
different values in subintervals of (0,7) which 
are bounded by the t^ and rJ; 


— Xf 


bAju) 

bu. 




\ 


b 

-■r“ 1^101 +M202 +M303 

bUj 


>(7-73) 


+ M 4 04,2 Ms 05, 2 1 “ 0, / = 1,..., m,J 


M/0,' = 0,1 = 1, 2, 3, (7-74) 


and 

M4 04,2 “Ms 05,2 " (^ (7-75) 

at all but a finite number of points in (0,7); 
\(co^+0)-\(co„-0) = 0, 

1=1,2 (7-76) 
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^1 (ff + 0) — Xi (t^ — 0) — ^04, 

+ T;5r = 0 (7-77) 

Ur +0)-Xi Ur - 0) - Tm, 

+ ^,,=0 (7-78) 

Xl U{ + 0) — X| Uf — 0) — To4i 
+ T5,*=0 (7-79) 

Xj (/j 0) — Xj (^j ~ 0) — T] 4 j 

+ tT55 =0 (7-80) 

^(6„+0)-W(co„-0) + 7,^„=0 
HUr + 0) - ff(t^ _ 0) - To 4 A-i Ur') 

— Ti 4 Aj(r^) + Tosf Ai (/^) 

+ t,5, A;(/') = 0 (7-81) 

/((rj + 0 ) - H(r; - 0 ) - 754 , ATi (tj) 

~^l*t A2 (^j) + T554 X'i(tf) 

+ rT,8 Al(rp = 0 (7-82) 

W(7* 0) - (r* - 0) = 0 

and 

HU* + 0) - W(f+ - 0) = 0 

for all a. r, 6 , and ij; at each of the points S = 
and t~, either 

«iW = «i(5) = 0 (7-83) 

or 

«,(S) = «i(5) = 0; (7-84) 


the boundary conditions;, = 0, s = 1, 2, must 
be satisfied along with the conditions 

(7-85) 

and 

and the Weierstrass condition 

mxf. U. \ t) < H{Xf, M. /) 

must be satisfied for each t in (0,7), where U 
is any function which along with Xi and X 2 
satisfies Eqs. 7-52, 7-53, 7-54, 7-55, 7-60, 
7-61, and 7-62 with u replaced by U. The 
statement of Theorem 7-1 is now complete. 

If there is only a scalar control variable 
m( 0, then the condition of Eq. 6-105 will be 
violated at points which are intersections 
of intervals in which Eq. 7-52, 7-53, or 7-54 is 
an equality. With an additional hypothesis, 
however, the conclusions of Theorem 7-1 are 
still valid. 

At a point t* ^ co^, it is assumed that \pi = 
^2 ~ 0, where ^1 > 0 and \p 2 > 0 ait any 
two of the constraints of Eqs. 7-52, 7-53, and 
7-54. If ^ is defined as = min (ipi ,^ 2 )* then 
^ > 0 replaces the conditions ipi > 0 and ^2 
> 0. It is assumed that dipi/bu and b\p 2 l^u 
are not zero at t*. The new constraint now 
satisfies the conditions of Eq. 6-105. 

If Theorem 6-8 is applied to the new 
formulation of the problem, the result is 
identical to Theorem 7-1 with the exception 
of Eqs. 7-73, 7-74, and 7-76. However, the 
new conditions on u are identical to those 
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implied by Eqs. 7-73, 7-74, and 7-76. The 
roles played by Mi and in Theorem 7-1 
would simply be combined in a new variable 
M. This result may be stated as Corollary 7 - 1 . 

Corollary 7-1: Let there be a scalar design 
variable u{t) and assume that any two of the 
inequality constraints, Eqs. 7-52, 7-53, and 
7-54 are equalities at t*. If the first partial 
derivatives of these two constraint functions 
with respect to u are not zero at then 
Theorem 7-1 holds. 

One further result may be easily obtained. 
If dipilbu and b\l/ 2 l^u are nonzero at /^and 
are of the same sign, then u is continuous at 
t*. 

To prove this, it is supposed first that 
b}pilbu> 0 and w(r* + 0 ) = - 0 ) - €, e 

> 0. Taylor’s theorem (Ref. 16, p. 56) implies 

*iU^w(/^ + 0)l 0)) 

^ bu 

where 0 < 6 < 1. But, b\l/ilbu > 0 and e > 0, 
so 


0=^1 w + 0)] < \l/i - 0)) = 0 

which contradicts the assumption e > 0. An 
identical argument holds in the remaining 
cases, som[/* - 0 )] = 0 . 

7-4.3 STATICALLY INDETERMINATE 
PROBLEMS 

A statically indeterminate beam may be 
classified as one of two types: 

1. The beam is supported in such a way 
that all reactions are determined to within a 


finite number of unknown constants, or 

2. The beam is supported in such a way 
that an infinite number of constants are 
required to specify the reactions (e.g., a beam 
on an elastic foundation). 

In the first case, the unknown constants 
appear in the expressions for M and V. By 
defming new state variables, with i > 3, to 
be these parameters, the following differential 
equations must be satisfied: 


dt 


= 0 ,1 > 3. 


In this way, statically indeterminate prob¬ 
lems of the first type are reduced to varia¬ 
tional problems to which Theorem 6-8 ap¬ 
plies. 

For statically indeterminate problems of 
the second type, however, more basic changes 
in formulation must be made. The fourth- 
order differential equation, Eq. 7-50 must be 
treated since q(t) may be qlt»Xi(t),X 2 (t)]. 
The fourth-order equation is equivalent to the 
first-order system 


dxi 

dt 


= X2 


dx 


2 . 


^3 


dt EI{u.) 


dX3 

dt 


= X4 


and 


dX4 

nr 


= -q(t*x^,X2) 


where Xi = x, X 3 = Af, and X 4 = V. Theorem 
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6 - 8 now applies to statically indeterminate 
problems of the second type. 

7- 4.4 SOLUTION OF THE EQUATIONS OF 

THEOREM 7-1 

The Lagrange multipliers, Xq, X}, X 2 , are 
not uniquely determined by Theorem 7-1. 
However, if the solution, x{t) and u(t), is 
normal (Ref. 17, p. 214) for the problem, it is 
permissible to put X© = 1. Theorem 7-1 then 
determines the remaining X^ uniquely. Abnor¬ 
mal solutions are peculiar in that there may 
be no other functions, x(t) and u(t), near 
them which satisfy the conditions, Eqs. 7-50 
through 7-55. The procedure adopted in this 
subparagraph is to assume there is a normal 
solution and then attempt to solve for it. If 
this fails, there is either no solution or the 
solution is abnormal, in which case a special 
analysis is required. In the following, X© will 
be taken as 1 . 

In order to determine u(r), consider any 
interval in which z 0 < z < 4, of the 
inequalities, Eqs. 7-52 through 7-55, are 
equalities and the remaining 4 - z are strict 
inequalities. Eqs. 7-74 and 7-75 show that the 
4 - z multipliers corresponding to the 4 - z 
strict inequalities are zero. Then, Eq. 7-73 is a 
system of m equations in the m functions 
u^(t) and the z nonzero multipliers. Further, 
the z equalities of Eqs. 7-52 through 7-55 
yield z equations in the Thus, there are 
m + z equations which are to determine the m 
+ z unknowns. The nonzero n.{t) are first 
eliminated and the Uj(t) are then found as 
functions of t and the parameters fi and { 2 - 

At points f* one part of Eq. 7-55 is an 
equality, say the /th part (/ = 1 or 2). In this 
case, only and ^i,/+ 3 ,r can possibly 

be nonzero. Eqs. 7-77, 7-78, and 7-81 are 
three equations from which and 
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^ 1/+3 r eliminated and the /' deter¬ 

mined as functions of the parameters (1 and 
{ 2 - Note that fi and {2 to left of /'need 
not equal (1 and (2 fo the right. In exactly 
the same way, the /J are determined by Eqs. 
7-79, 7-80, and 7-82. (Continuity of H at /* 
and t* determines the /t and t* as functions 

T) 6 17 

of (1 and ( 2 * 

The equations previously enumerated 
determine /♦, /”, /", and t* as functions of 
/, ( 1 , and ^ 2 - problem would be solved 
by direct integration of Eqs. 7-71 and 7-72 
and application of gi - = 0 if and {2 

were known. 

The conditions which determine fi and {2 
are Eqs. 7-83, 7-84, 7-85, and 7-86. Eqs. 7-85 
and 7-86 after elimination of 7 i and 72 , yield 
two equations in fi and (2 • ft there are w of 
the points /’and /“, they subdivide ( 0 ,r) into 
w + 1 subintervals. Since these are the only 
possible points of discontinuity of Xi and X 2 , 
there are just w + 1 pairs, and { 2 * which 
represent 2w + 2 unknown. Thus, there are 
2 >v + 2 equations from which the 2 w + 2 
values of and (2 *^ay be determined. If this 
is not the case, the problem has no solution or 
the solution is abnormal. 

It is assumed now that u(t) and points t' 
and /“ are known functions of and ( 2 - A 
numerical method is developed which can be 
used to solve the equations given above for (1 
and (2 • A numerical solution is required since, 
even for very simple problems, the function 
/ 2 (Lti *^ 2 ) is far too complicated to integrate 
in closed form. 

Expressions, Eqs. 7-85 and 7-86 generally 
yield two easy relations between fi and ( 2 - 
Eqs. 7-83 and 7-84, however, require succes¬ 
sive integration of/ 2 (/,{i ,€ 2 )- To complicate 
matters, some limits of integration are the 
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points t“ and rj, which arc themselves func¬ 
tions of and { 2 - Therefore, Eqs. 7-83 and* 
7-84 form a system of nonlinear, finite 
(nondifferential) equations in (1 and ( 2 * 


A Generalized Newton Method (Ref. 18, p. 
220) is used to solve this set of equations. A 
generalization of Leibniz’ Rule is used to 
compute the required derivatives of integrals 
with variable limits of integration. This rule is 
(Ref. 16, p. 80). 


dT 


Cg2 (t) 
]gi(r) 


f2it^T)dt 

gi(r) 

it (r) 


dt 


+/i \g2 (r). t] 


dg 2 (r) 

dr 


> (7-87) 


-fi l«i(r),rl 


dgi (t) 
dr 


where 3 / 2 (t,r)/dr is piecewise continuous, and 
gi and g 2 differentiable. It is assumed, 
also, that /2 and 8 / 2 /dr are continuous at r = 
gi(T) andgiir). 


7-4.5 BEAMS WITH RECTANGULAR 
CROSS SECTION OF VARIABLE 
DEPTH 

Three examples are considered in this 
subparagraph. In each example, the beam 
cross section is rectangular with fixed width 
and variable depth. Also, the constraint, Eq. 
7-55, is taken as 

-A<x(t)<A (7-88) 

where A > 0 is a constant. 


First, the equations of Theorem 7-1 are 
written out in detail and simplified for the 
case of beam with rectangular cross section of 
variable depth. In pars. 7-4.5.1, 7-4.5.2, and 
7-4.5.3, three specific examples are con¬ 
sidered. These examples range from an easy 
problem in par. 7-4.5.1 to a rather complex 
one in par. 7-4.5.3. 

The cross section considered here is shown 
in Fig. 7-9. For this particular cross section, if 



h 

t) 




Figure 7-9. Rectangular Cross Sectior} 


’’m.x > 1/2 Eqs. 7-52 and 7-53 are 

satisfied if and only if Eqs. 7-56 and 7-57 are 
satisfied. This result may be proved by ex¬ 
pressing and as functions of d and 
applying methods of ordinary calculus. The 
restriction > 1/(2 is necessary, 

since at the extreme fiber of the beam, the 
principal shear stress is half the principal 
normal stress. This relation between and 
^mix ^ restriction for design of metallic 
beams. Yield stresses for steel and other 
common metals satisfy this condition. 

In this case, there is only one design 
variable h(t); Eqs. 7-44 through 7-48 are 



(7-89) 

A{t) = bh{t) 

(7-90) 


7-25 


Digitized by L^ooQle 



AMCP 706-192 


II 

(7-91) 

In any 
equality. 

interval where Eq. 

7-54 is an 

QO) 

(7-92) 

bh^it) 

12 

/o 


and 


so 



C(t) 

(7-93) 

h{t)= ( 

' 12/.\ 

(7-95) 


Af(r) and V{t) are assumed to be known, 

piecewise twice continuously differentiable In any interval, (/". rp, where Eq. 7-88 is 

^unctions of t whose discontinuities occur at an equality, direct differentiation and use of 
points ^ . Eqs. 7-71 and 7-72 yield 


Eqs. 7-89 through 7-93 and Eq. 7-70 along 
with Xo = 1, yield 


H- - bh’^\xX2 



The procedure outlined is now used to 
determine h{t). In any interval where Eq. 7-56 
is an equality. 


12 Af(/) 


(7-96) 


In order for Eq. 7-96 to be satisfied, it is 
necessary that M{t) = 0 (hence also V{t) = 0 = 
q(t)) must be identically satisfied in (/", r^). 
If this is the case, h(t) is given by Eq. 7-95. 


6|Af(0l 

bhHt) 


so 


In any interval where Eqs. 7-56, 7-57, 7-54, 
and 7-88 are all strict inequalities, Eqs. 7-84 
and 7-75 show that M/(f) = 0,i = 1,..., 5. Eq. 
7-73 then is 


ba 

L max 




■^>■•( 0-0 


(7-97) 


In any interval where Eq. 7-57 is an 
equality. 


3|F(0| 

2bh(t) 


= r 


max 


SO 


hit) = 


3|K(0I 
Ibr _ 


7-26 


so 


It is worthwhile to note that in order for 
Eq. 7-97 to hold the product X2(/)A/(0 must 
be positive. That is, X 2 (/) andAf(f) must have 
the same algebraic sign throughout any inter¬ 
val in which Eq. 7-97 holds. 


Digitized by L^ooQle 



AMCP 706-192 


In a more compact notation, 


3|K(t)| 


/i (0 = 


2b T. 


(6mt)\\ 
\ / 


1/2 


max 


m ■ 

r36X^(r)Af(/)] 

L Eb^ J 


1/4 


if|T|=T. 


(7-98) 
if/ = /o 


max * 


if It| < T, 

l®l< ®m.x> 

and/> /q. 


The Weierstrass condition shows that the 
largest of the expressions in Eq. 7-98 is the 
proper value of h(t)- 

Eq. 7-98 in Eq. 7-64 yields 


/( 0 = < 


-c,/^^(olK(/)^^ 


-c,i/»/(or'/* 1 

1 


X sgn [3/(0], J 

MU) 

EIo ’ 

II 

(7-99) 



X \M(t)\*l^ 


xsgnlM(t)], J 

' and/> /o. 



and the function sgn ( ) is defined by the 

relation 

q sgn(q) = q 
for real^r. 

Equations which determine the special 
points f;, t*, r“, and may now be found. In 
the problem at hand, Xi (t) and X 2 (t) are 
constant, so their derivatives are zero. Eq. 
7-81 is then 

^(/;-0) = ^(r; + 0). (7-100) 

Experience has shown that on both sides of 
Eqs. 7-56, 7-57, and 7-54 are strict 
inequalities. Assuming this is this is the case, 
Eqs. 7-94 and 7-98 together with Eq. 7-83 or 
7-84 may be used to simplify Eq. 7-100. The 
result is 

= [Xj(r; + 0 )/W(/; + 0 )]>/‘' . (7-101) 

Eq. 7-98 and I > 0 imply Af(/;) ^ 0, so if 
MU) is continuous at fp, then Eq. 7-101 
reduces to 

X2(r;-0) = X 2 (f; + 0). (7-102) 


where 


Points of discontinuity of MU) niust be 
checked in Eq. 7-101 as possible 


Cl 


32b^T^ 


9E 


Eq. 7-96 shows that points and t~ can 
occur only in intervals where Af(/) (hence also 
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y(0 and q(r)) is identically zero. Since this 
situation is not common in practical prob> 
lems, such intervals will not be discussed here. 


According to Theorem 7-1, the points 
are determined by the condition 

- 0) = + 0) . (7-103) 


~ c^\j(Q)M(Q) I y(Q) r’ 

1/4 


By direct computation it is seen that the 
partial derivatives with respect to h of the left 
sides of 


Using the definition of C 3 in this equation 
and manipulating the result yields 

X,(G)A/(G) \ 9E / 


\a\-a„ 


< 0 


and 


\V(Q)\^' 

X 2 (G)Af(G). 


3/4 


+ -inaJL = 0. 

9E 


lo - I< 0 

are all negative at points where M(t) ¥= 0 ¥= 
V(t), The result stated just below Corollary 
7-1 then shows that points of intersection of 
intervals in which one or more of the above 
inequalities is an equality may be determined 
by the condition 

Mf* - 0) = /i(f* +0) . (7-104) 


(7-105) 

By putting 

iK(G)r 

Xa(G)3#(G)J 


and 



Eq. 7-105 becomes 


If Eq. 7-104 is used to determine /*, then 
points must be checked in Eq. 7-103 as 
possible . 

Let G = f* ^ be defined to be a point 
of intersection of two intervals such that \t\ = 
^max of r = C and Eqs. 7-56, 

7-57, 7-54, and 7-88 are strict inequalities on 
the other. Point Q is to be determined by Eq. 
7-103. Due to continuity at / = G, Eq. 7-103 
may be written as 
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3P^ - 4CP^ + = 0 . 

The roots of this equation in P are C C, 0(1 + 
y/2i), and C(1 - V^O, where = - 1. The 
fourth powers of the last two roots are not 
real, so the only real solution of Eq. 7-103 is 

/ 64b^T^ \ 

|K(Q)r=(^— ^ ""j x3(G)Af(G). 

(7-106) 
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Similarly, S - t* cj is defined to be a 

t| a 

point of intersection of two intervals such 
that |a| = ^ 

7-56, 7-57, 7-54, and 7-88 arc strict inequal¬ 
ities on the other. Just as above, Eq. 7-103 
reduces to 


3P-4dP+C^=0 


where A < B < C, and 




dgiiot) 




dot 




(7-109) 


Therefore, the only real solution of Eq. 
7-103 at/ = 5is 



In deriving Eqs. 7-106 and 7-107, it was 
assumed that Q and S were not equal to any 
The co^ are, therefore, possible choices 
for Q and S and must be checked in Eq. 
7-103. 


Leibniz* Rule, Eq. 7-87, is used repeatedly to 
obtain Eq. 7-109. 


7-4.5.1 A PROBLEM WHICH CAN BE 
SOLVED ANALYTICALLY 

As a first example, the cantilever beam of 
Fig. 7-10 is considered. This problem is simple 
enough that a solution can be obtained 
analytically. 


In particular problems, the following two 
identities are used: 


Boundary conditions for this beam are 



+ 


+ 


t 
tt 
tt 


f(v)dv^(C-B) 


f{‘q)dndv 


f(Ti)dridv, 


£ 


mdv 


(7-108) 


Xi(0)=x,(0) = 0. (7-110) 



Figure 7-10. Simple Cantilever Beam 
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The bending moment and shear are 


and 


M{t) - M> 0 [M constant) 
and 

Ka) = 0. 

For simplicity, let 7© > 0 be small enough 
so that 


— 

dt 


- =/ 2(0 


and application of Eq. 7-110 yields 
x,(0 = j' (7-113) 



If this is the case then lit) > lo for aU t. 
Further, since r(t) = 0, Eq. 7-98 may be 
simplified to 



if I a I = a 


max 


if |al< 


o 

max 


(7-111) 


Also, Eq. 7-99 becomes 



and 


X, (t) = I* I” hin)dndv . (7-114) 

The inequality fiit) < 0 and Eq. 7-113 
imply ^2 < 0 in (O.T), so there can be no 
points t*. The only possible point at which 
Eq. 7-88 is an equality is t = T. Since/ 2 (f) < 
0, Eq. 7-114 implies Xi(0 < 0. Therefore, if 
Eq. 7-88 is an equality, it must be Xi(r) = 
- A. Further, since there are no /J, fj, or 
Ca;^, X| and X 2 are continuous and there is 
only one pair of constants, (1 and ( 2 * to be 
determined. 

It is assumed first that Xi(7)> - A. In this 
case, Eq. 7-86 implies 

x,(r> = ti=o 

and 




if I a I < 


max* 


(7-112) 

Integration of the differential equations 


which in turn implies ^ 1 =^ 2 = 0. Since X 2 = 
0 throughout (0,7), the second part of Eq. 
7-111 cannot occur. Therefore, the beam of 
minimum weight is uniform. Eqs. 7-112 and 
7-114 then yield 




2 


7-30 


Digitized by L^ooQle 



AMCP 706-192 


Therefore, Eq. 7-111 with \a\ = is the 
solution of the problem provided the deflec¬ 
tion requirement A is such that 



If the deflection requirement A does not 
satisfy Eq. 7-115, then it is necessary that 
Xi(T) - - A. Otherwise, this argument would 
hold, and the deflection at T would violate 
Eq. 7-88. Therefore, the additional boundary 
condition. 






g3 =ACi(r) + A = 0, 

must be satisfled. The two constants (i and 
(2 must now be found. 

The only useful relation given by Eq. 7-86 
is 


The right-hand side of this equation may be 
simplified by eliminating either fi or 
through use of Eq. 7-116. Since fi does not 
have as much physical significance as t*, it is 
eliminated. The conditions Xi(T) = - A 
becomes 


-{iT = 0. 

This implies fa = ^i so that 


X2(r) = ti(T-r), 



and only (1 remains to be found. 

On physical grounds, it is expected that the 
beam should be stiffest near f = 0 in order to 
reduce the deflection at T efficiently. Also, 
since X 3 is largest at f = 0 , the second part of 
Eq. 7-111 would tend to stiffen the beam 
there. It is assumed, therefore, that there is 
just one point having I I < 
left and I I = 7-107 for 

t* is 

W = (7-116) 

Eq. 7-114 with / = T may be integrated 
using Eqs. 7-108 and 7-1 1 2 and becomes 


(7-117) 

The derivative of the right side of Eq. 
7-117 with respect to is zero at = T and 
positive everywhere else. This means that Eq. 
7-117 has at most one solution. Eq. 7-116 
then determines (1 and the problem is solved. 

As a numerical example, the beam of Fig. 
7-10, having the following properties, is con¬ 
sidered: 

r= 10 in. 

^ = 1 in. 

®m.x = 30,000 lb/in.» 
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£=10*^ lb/in.2 
and 

M = 450 in.-lb 

If A > 1 in., then the beam of minimum 
weight is uniform with h - 0.30 in. 

For a more meaningful problem, A = 0.5 
in. is considered. Eqs. 7-117 and 7-116 yield 

= 7.7 in. and fi = 2.17. The precise shape 
of the optimal beam is given by Eq. 7-111. By 
putting Eqs. 7-111 and 7-90 in Eq. 7-49 arid 
performing the indicated integration, the vol¬ 
ume of the optimal beam is found to be 3.59 
in? A plot of the profile of the optimal 
beam may be made by direct substitution into 
Eq. 7-111. This profile is shown in Fig. 7-11. 


0.431 in. H 


E 


0.30 in. 


7.7 in. 


\M 
10 in. 


Figure 7- 11, Cantilever Beam of Minimum Weight 


By elementary computation, it is seen that 
the uniform beam which has JCi(lO) = - 0.5 
in. and satisfies Eqs. 7-56 and 7-57 is 0.378 
in. deep; so its volume is 3.78 in? The 
designed beam, therefore, has 5.3% less vol¬ 
ume than a uniform beam which will satisfy 
the same stress and deflection requirements. 

7-4.5.2 SIMPLY SUPPORTED BEAM 
WITH POSITIVE DISTRIBUTED 
LOAD 

The beam considered here is simply sup¬ 
ported (see Fig. 7-12) with piecewise con¬ 
tinuously differentiable distributed load q(t\ 
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q{t) > 0 for all / in (0,7). The load q{t) of this 
form implies V{t) is non-negative at zero and 
decreases monotonically in (0,7). M(t) is zero 
at both t = 0 and t = 7. Further, on either side 
of the point where M(t) has its (non-negative) 
maximum it is monotone. 


ffmTri^ rnrmTn 

TTtTT 

Figure 7- 1Z Simply Supported Beam With 
Positive Distributed Load 


If 1 q{t)dt # 0, then M(t) and V{t) cannot 
be zeio^at the same point. There is, therefore, 
no danger that the optimal beam will have 
h(t) * 0 at any point. For this reason, the 
constraint, Eq. 7-54, is not imposed here. 

Since 3f(0) * M(T) = 0 and K(0) # 0 # 
F(0, It| = is expected near the ends of 
the beam. Toward the center of the beam, 
M{t) becomes large, so |a| is expected 

there. If the beam requires stiffening, the 
additional material can best be used near the 
point of maximum deflection. This argument 
indicates that (0,7) should be broken up into 
subintervals as shown in Fig. 7-13. 

In terms of previous notation, ti, U* 
and ts correspond to the notation t^; and 
to /*. For certain ranges of A some of the 
subintervals shown in Fig. 7-13 will not 
appear. 

Provided ti and ts separate intervals in 
which |r| = and |a| = ^ 

determined by Eq. 7-104. The points tj and 
^ 4 , when they exist, are determined by Eq. 
7-107. Finally, t^ is determined by Eq. 7-102. 
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Figure 7- 13. Subdivision of the Beam With Distributed Load 


The boundary conditions for this problem 
are 

x ,( 0)»0 

and 

Eqs. 7-75 and 7-76, therefore, yield 
Xa(0) = Xa(r) = 0. 

The constants fi and fa in Xa may have 
different values on opposite sides of ta. To 
the left of ta, 

Xa(0) = ta - x0 = 0 

so 

Xa(/)=-fit = rit (7-118) 

To the right of ta» 

Xa(r) = t2 

so 

x,(0 = (l r*(l -y). (7-119) 

The constants fi and fa» which are intro¬ 
duced in Eqs. 7-118 and 7-119, are now to be 
determined. 


Eq. 7-102 for yields 

/ - 

* ni+w 

Assuming ta ^nd exist, the equations that 
determine them are 

^(ta)“^ ~E ) (7-120) 

and 

««.)-(■%■)?, (l-f) 0-121) 

In this case, ti and ts are determined by Eq. 
7-105. If /a or does not exist, then ti or ts 
is determined by 

( 64h^T^ \ 

I ) r = (■ f » hmx ) 

(7-122) 

or 

xfi(l-^)««.). (7-123) 

It is noted that Eqs. 7-120 through 7-123 
can be solved easily for and but, in 
general, not so easily for the t^. In the 
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development which follows, it will be con¬ 
venient to use Eqs. 7-120 through 7-123 to 
solve for f i and fa functions of the 


/^a=A + (r-/4)l f2iV.i2)dv 


f: 


The conditions that are to determine the 
unknown are x 1 (^ 3 ) = A and = 0. 

However, for computational reasons, a more 
convenient set of equivalent conditions is 


xi(0) = 0 (7-124) 

and 

JCi(T) = 0 (7-125) 

where Xiit^) - A and X 2 (h) = 0 are used as 
initial conditions for integration. 

Conditions, Eqs. 7-124 and 7-125 may be 
written explicitly as 


pi p. 

/?,= A + riI /i(»?)c/Tf + /2 I /j(Tf)cfjJ 

Jo J t, 


+ 13 1 


•1: 

[■ i; 

n: 


f2(v)dvdv 


f2(V)dvdv 


1:1: 


/ 2 (r?,fi)flfT?flfr = 0 (7-126) 


and 

7-34 


+ (r-/s)[ f2(v)dv 

J ^4 

I f2(V,t.)dvdv 

J r, Jr, 

+ 1 { f2(vi)dndv 

+ [ [ f2(fi)dvd^ = 0, (7-127) 

Jr, Jr, 

where Ri and R 2 are introduced for nota- 
tional purposes. 

It is assumed now that q(tX b, E, T, 
and are given. The equations that 

determine the are different in four distinct 
ranges of the deflection requirement.' These 
ranges of A are described in the following: 

1 . Aq denotes the largest deflection that 
occurs when the subinterval (fa, ^ 4 ) does not 
appear, i.e., when the beam is specified by 
only the first two parts of Eq. 7-98. 

If A > Ao, then the beam specified by the 
first two parts of Eq. 7-98 is the one of 
minimum weight. 

2. For A slightly less than Aq, there exist 
points t 2 and ^4 that are determined by Eqs. 
7-126 and 7-127. 

As A decreases, points t 2 and ^4 move 
toward zero and T, respectively. There is a 


Digitized by L^ooQle 



AMCP 706-192 


value of A, say A = Ai, for which either or 
(4 first coincides with ti or /s, respectively. 
For definiteness, assume (2 “ h when A = 
Ai. 

3. For A slightly less than Ai, points ti 
and (4 are determined by Eqs. 7-126 and 
7-127. 


method of solution will be described for the 
Case 2. 

An iterative method called Generalized 
Newton Method (Ref. 18) is used to solve for 
t 2 and t 4 . The procedure begins by estimating 
values and f 4 ; and then making a correc¬ 
tion according to the formula 


As A decreases, points ti and ^4 move 
toward 0 and T, respectively. There is a value 
of A, say A = A 2 , for which ^4 first coincides 
with . 

4. For A < A 2 , points ti and ts are 
determined by Eqs. 7-126 and 7-127. 




~ ” 


ra/?, 

BRC 

-1 




h 


a.. 

bu 


Ri 






bRi 



u 


^4 


jdt2 

9^. 


R 2 


(7-128) 


This explanation of the behavior of the is 
not the result of a mathematical analysis. It is 
expected on physical grounds and has been 
valid in each case treated. 

The values of Ai and A 2 could be obtained 
analytically. However, their determination 
would be of the same order of difficilty as 
the optimization problem considered in this 
paragraph. 

A| and A 2 can be determined by a trial 
and error scheme. For example, to determine 
Ai. t 2 is put equal to ti and Eq. 7-107 
determines . Then, is guessed and Eq. 
7-102 solved for ^ 2 - Numerical integration of 
Eqs. 7-60 and 7-61 indicates the correction 
that is to be made in ^ 3 . When ^ located 
accurately, the resulting deflecting at ^3 is A|. 
A similar procedure is used to determine A 2 
(^4 is put equal to f 5 ). 

The solution of Eqs. 7-126 and 7-127 for 
the differs only in certain details depending 
on whether the given value of A is in the 
range described by Cases 2 , 3, or 4. The 


where [ denotes matrix inverse, and ^2 
and (4 are improvements on the estimate. 

Eqs. 7-102 and 7-107 determine = 
hiti ft2)f fi " fi(f2)» and ^2 “ t2(U)- By nse 
of this information, the derivatives in Eq. 
7-128 are computed by the chain rule of 
differentiation. For example. 



ZdR.^dR, dt3\ at/ft 

\ 1 dt, df, J dt2 


(7-129) 


where 


^i,r, =^ 2 [/ 2 (^ 2 - 0 )-/ 2 (t 2 + 0 )] 

is the partial derivative of R 1 with respect to 
t 2 with all variables in R 1 taken as indepen¬ 
dent. 




d/ 2 (T?.fi) 

dfi 


drt 
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-!" J’ 

J f, J f, 

and 

■^ = ^3/l(^3 -0). 

0/3 

Similar expressions for the remaining deriva¬ 
tives in Eq. 7-128 are derived with the aid of 
Eq. 7-109. 

The iterative procedure for determining 
and t 4 is: 

Step 1. Make an estimate, ?2 ^nd ^ 4 , 

Step 2. Solve Eqs. 7-120 and 7-121 forfi 
and fa, 

Step 3. Compute, numerically, all the 
integrals in Eqs. 7-127, 7-128, and 
the remaining derivatives corre¬ 
sponding to Eq. 7-129, 

Step 4. Compute the right side of Eq. 
7-128, and 

Step 5. With this improved estimate return 
to Step 1. 

This procedure has been programmed for a 
digital computer. The program was arranged 
in such a way that only Aq, Ai, A 2 . M(t), 
V(t)y and the physical properties of the beam 
need to be specified. Many different loading 
situations may thus be considered withoAit 
altering the program appreciably. 

As a numerical example, a beam with the 
foUowing properties is considered: 

q{t) = /Ib/in. 
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T = 40in. 

b = 0.25 in. 

E =10*' lb/in.2 

^mix = 15,000 Ib/in.^ 
and 

^mtx = 30,000 Ib/in.^ 

For this problem, it was found that 
Ao = 0.774 in. 

Ai = 0.728 in. (^2 “ h) 

and 

A 2 * 0.470 in. (^4 *fs). 

As has been noted, the magnitude of the 
deflection requirement A plays a major role in 
the outcome of a particular problem. In order 
to emphasize the effect of A on the properties 
of the optimal beam, the numerical example 
given was solved for eight different values of 
A. The results are presented in Table 7-4. 

In Table 7-4, the first column contains the 
values of A considered. The following seven 
columns give information which, when sub¬ 
stituted into Eq. 7-98, completely specifies 
the optimal beam. The next column gives the 
volume of this optimal beam. The final two 
columns give the volume of the lightest beam 
of constant depth which satisfies the condi¬ 
tions of the problem and the percent saving 
realized when the optimal beam is used 
instead of this uniform beam. Dashes have 
been inserted in the table when the quantity 
to be tabulated does not exist. 
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For each value of A, the iterative procedure 
used to solve the problem required approxi- 
mately 40 sec per iteration on an IBM 1410 
Computer. Further, three to six iterations 
were sufficient to obtain convergence of the 
residue to seven decimal places, so computing 
time was not excessive. 

It is shown (Ref. 18, p. 222) that if the 
sequence of approximations constructed by 
the Generalized Newton Algorithm converges, 
then it must converge quadratically, i.e., the 
error at the n + 1st step is proportional to the 
square of the error at the nth step. This rapid 
convergence was observed in the numerical 
calculations and explains why only three to 
six iterations were required. 


and 

K(O = 0. 

In this problem A > r/2 is assumed. 

Since M{t) is never zero, the requirement 
|a| < implies /i # 0 for all t. / = 0 is 

impossible, so the requirement I > Iq is not 
enforced. 


Eqs. 7-98 and 7-99 in this case are 



7-4.5.3 A PROBLEM OF A MORE 
GENERAL TYPE 

The beam considered here is loaded as 
shown in Fig. 7-14. 


and 


/ 2 ( 0 =< 




( -CiM 

xsgn[3f(/)l, 




X sgn [3f(/)], 


if |ff| < o„ 



2M , A 



For a given deflection requirement A there 
are three possibilities concerning attainment 
of the maxium deflection. Either 


Figure 7- 14. Beam With an Inflection Point 


1 . \xit)\< A for all/. 


2. \xit)\ = A for just one /, or 


Boundary conditions are, as in the simply 
supported case. 


JCi (0) = X| (r) = 0. 


For the given loading. 




M, if 0 < t < A 
-M, ifA< t<T 


3. |x(/)| = A for two distinct t. 

The third possibility occurs here because 
changes sign. In Case 1, “ ^mix 

out the beam determines h(t). TTie Case 2 
may be treated in exactly the same way as the 
problem in the preceding paragraph. Case 3 is 
considered in detail here. 

Assume there are two points at which 
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i 


■ 





(7-134) 


, _r3 + f4 


(7-135) 


and 


Figure 7-15. Subdivision of the Beam 


r.=rri-(.4")±]. 
L \<.x/ f 4 j 


(7-136) 


Points t 2 and f 7 , where Case 3 occurs, are 
so they are determined by Eq. 7-102. 
Further, XiC/j) = A and Xi(r 7 ) = - A. Points 
and fg are t* and are determined by 
Eq. 7-107. 


In this problem, /g and may be points of 
discontinuity of Xj and Xg. The conditions, 
Eqs. 7-85 and 7-86 on X 2 are X 2 ( 0 ) = X 2 (r) = 
0 . Therefore, X 2 may be written as 


/ 


X 2 ( 0 =< 


fa ~ fa f 


, if 0 < t < t2 

y if t2 < t < tf 


\ 



, if f 7 < / < /. 


Solving Eqs. 7-102 and 7-107 for the 
yields the foUowing: 


In obtaining the expressions for and tg, 
use was made of the fact that < A < 

The result, X 2 (r)A/(r) > 0, from Eqs. 7-97 
and 7-98 shows that this is true. To prove 
this, assume for definiteness > A. Since 
AfftJ changes sign at A and X 2 (tj is con¬ 
tinuous there, X 2 (A) 0. But, since XgCO is 

continuous near A, it is arbitrarily near zero 
in a neighborhood of .4. Eq. 7-130 then shows 
that h(t) is arbitrarily near zero in a neighbor¬ 
hood of i4 and this violates the condition |a| 
< LiJ^ewise,>l < 


Conditions that determine the are 


'A,X2(f2)“0 


and 


^1(^7) “A, X 2 (f 7 ) — 0 . 



(7-131) 

For computational reasons, it is more con¬ 
venient to use the following equivalent set of 
conditions; 

(7-132) 

JCi(0) = 0 


•^ 2 (^ 7 ) “ 0 

(7-133) 

Xi(r7) + A = 0 
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XiiT) = 0 

where JCjC/a) = A and XjC/a) = 0 are used as 
initial conditions for integration. 

More explicity, these equations are 


Ri =x,(0) = A + /i I f2(r))dr) 




fi' 

•'o 


ff 


fiMdridv 


f2(fl,Si)dvdv=^0 


(7-137) 


R 2 “<^ 2 (^ 7 ) “ 


= ( / 2 (r?,? 2 .{ 


+ \ f 2 (v)dri 


f2iri.i2>i3)dv^0 (7-138) 


R 3 =^ 1 (^ 7 ) + A = 2A 


+ (^-^3)1 


/ 2 ( i 7>?2 fi3)dri 


+ ^7 -/6)( 

Jr, Jr, 

^ J 

*• 


f 2 (v)dv 




f 2 (v)dr)dv 




(7-139) 


/?4 =x(D = -A 


+ (7’-/,) f 

('/.<„ 

J r, J r, 

+ ( [ /2(t?5 


f 2 (V.U)dr} 


f2{rt,iA)dndv 


f2(V)dridv = 0 (7-140) 


where the are introduced for notational 
purposes. 

Generalized Newton Method is used to 
solve Eqs. 7-137 through 7-140 for the f^. An 
initial estimate / = 1 , 2 , 3 , 4 , is made; and a 
correction is computed according to the 
formula 
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’rf 


li 


~r7 

fa 


fj 


R 2 

Fa 


fa 


R3 

_F4_ 


_f4_ 




(7-141) 

where terms on the right side of Eq. 7-141 are 
computed in terms of the The corrected 
guess then takes the place of and the 
process is repeated. 

The derivatives of the with respect to 
the are computed by the chain rule of 
differentiation, Eq. 7-109, and Eqs. 7-137 
uough 7-140. Just as in Eq. 7-129 several of 
these derivatives must be determined by 
successive numerical integration. 

The matrix of derivatives which appears in 
Eq. 7-141 has sixteen elements. Twelve suc¬ 
cessive definite integrals appear in one or 
more elements of this matrix. Therefore, 
considerable computation is involved in each 
iteration. All this computation was incor¬ 
porated in a single computer program. 

As a numerical example, the beam of Fig. 
7-14 having the foUowing properties is con¬ 
sidered: 

1,100 in.-lb 
r = 40 in. 
b = 0.5 in. 

£ = 10*' Ib/in? 
i4 = 25 in. 


= 10.000 iWin? 

and 

= 20,000 lb/in.2 

It was noted that three distinct situations 
may occur depending on the value of A. In 
this example, the problem breaks down as 
follows: 

1. If A > 0.509, then Xi (/) < A for all/, 

2. If 0.156 < A < 0.509, then there is just 
one point / for which |xi(01 = A, and 

3. If A < 0.156, then there are two values 
of t for which |xi (r) I = A. 

The numerical example given was solved 
for eleven different values of A. The results of 
these calculations are presented in Table 7-5. 
The first column of this table consists of the 
values of A considered. The following ten 
colunms give information that, when 
substituted into Eq. 7-130, completely speci¬ 
fies the optimal beam. The next colunrn gives 
the volume of this optimal beam. The final 
two columns give the volume of the lightest 
beam constant depth that satisfies the condi¬ 
tions of the problem and the percent saving 
realized when the optimal beam is used 
instead of this uniform beam. Dashes have 
been inserted in the table when the quantity 
to be tabulated does not exist. 

For each value of A, the iterative procedure 
used to solve the problem required approxi¬ 
mately two minutes per iteration on an IBM 
1410 Computer. However, three to five itera¬ 
tions were sufficient to obtain convergence of 
the residue to seven decimal places, so com¬ 
puting time was not excessive. This rapid 
convergence is, again, characteristic of the 
Generalized Newton Method. 

An interesting sidelight of this particular 
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example concludes the present subparagraph. 
A plot of volume (of optimal beam) versus 
deflection requirement for the problem con¬ 
sidered here is given in Fig. 7-16. From this 



Figure 7-16, Volume vs Deflection 
Requirement 

graph, it appears that the volume of the 
optimal beam is a continuous function of 
deflection requirement. This is a rather re¬ 
markable result in view of the fact that 


optimal beams with deflection requirements 
^eater and less than 0.156 have considerably 
different form. The beam profiles of Figs. 
7-17 and 7-18 illustrate this difference graph¬ 
ically. As A decreases toward 0.156, the jump 
in ^(0 at r = 25 (see Fig. 7-17) becomes more 
pronounced. However, for A very slightly less 
than 0.156, the profile is continuous, much as 
in Fig. 7-18. 

7-4.5.4 CONCLUSIONS 

The examples considered in pars. 7-4.5.2 
and 7-4.5.3 are of the order of complexity 
that might be found in actual practice. In 
these examples, a saving of material up to 
33% is realized when nonuniform, optimal 
beams are used instead of uniform beams. For 
more complex loading situations, the saving 
may be even greater. From an engineering 
viewpoint, such savings are significant. 


In structural applications, this saving may 
be offset by additional cost of fabrication. 


All dbnensioiis in inches 


1.14 



Figure 7- 17, Profile of Optima! Beam for A = 0,16 


All dimeniioiis in inches 



24.5 31.0 33.7 40 

Figure 7- 18. Profile of Optima! Beam for A = 0.15 
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However, for applications in which weight is a 
premium, such as in aerospace work, fabrica¬ 
tion of minimum weight structural members 
may be quite feasible. Further, if the cost of 


forming nonuniform beams is not prohibitive, 
such as in the manufacture of reinforced 
concrete beams, then nonuniform optimal 
beams may be used to advantage. 
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CHAPTER 8 

METHODS OF STEEPEST DESCENT FOR OPTIMAL DESIGN PROBLEMS 


8-1 INTRODUCTION 

As seen by the examples of Chapter 7, 
solution of the necessary conditions for the 
general problem of optimal design is difficult. 
Even in idealized design problems numerical 
methods must normally be employed to 
construct a solution. 

The numerical techniques for the indirect 
method presented in par. 6-5 and in Chapter 7 
are iterative in nature. Each of the techniques 
requires that an estimate of the solution be 
made before the iterative process may be 
initiated. In many cases, particularly in new 
problem areas, the designer may have only a 
gross notion of what to expect of the solution 
so his initial estimate may be poor. 

Convergence of the techniques of Chapters 
6 and 7 are reported to be very poor unless 
good estimates of the solution are available. 
In fact, these iterative techniques often di¬ 
verge for poor estimates of the solution. On 
the other hand, if a good initial estimate is 
available, these methods converge very rapid¬ 
ly. 

This discussion illustrates the need for a 
workhorse technique that may be used even 
when only poor estimates of the solution of 
the optimal design problem are available. The 
method should be capable of making steady 
improvement in an estimated solution and, in 
fact, converge to the solution. Rate of con¬ 
vergence could be sacrificed for dependability 
if required. 


A second desirable property of a general 
method of optimal design is that it apply 
routinely to a large class of real-world optimal 
design problems. To be useful to the working 
design engineer, the method should apply 
whenever the designer has developed the 
capability to analyze the system to be de¬ 
signed. Further, the method should be ex¬ 
plicit enough so that a senior engineer can set 
the problem up for computation and a less 
experienced junior engineer can program the 
algorithm for use on a digital computer. 

The methods to be developed in this 
chapter and applied in the next have many of 
these nice-to-have properties. The basic idea 
of these direct methods is to simplify the 
basic design problem so that it will readily 
yield information which allows the designer 
to make a small improvement in an estimated 
optimum design. After the improvement is 
made, a new and better estimate of the 
solution of the optimal design problem is 
obtained. The process is repeated successively 
to obtain small improvements in the best 
available estimate of the solution until the 
design obtained is sufficiently near the opti¬ 
mum. 

The basic method of simplification of the 
design problem is to expand functions in¬ 
volved in the problems through use of Tay¬ 
lor’s Formula. In this way, a simplified 
problem is obtained which serves as a good 
approximation of the original problem pro¬ 
vided only small changes are allowed in 
certain variables. 
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8-2 A STEEPEST DESCENT METHOD FOR 
THE BASIC OPTIMAL DESIGN PROB¬ 
LEM 

8-2.1 THE PROBLEM CONSIDERED 

In order to present the basic ideas of the 
Method of Steepest Descent, consideration 
here will be limited to optimal design prob¬ 
lems with fixed endpoints, no discontinuities 
in the basic problems, and no intermediate 
conditions on the state variable. As seen in 
par. 6-4, this eliminates state variable in¬ 
equality constraints from direct treatment. 
All these features of more general optimal 
design problems will be treated in par. 8-3. 

Speciflcally, the problem treated here is to 
find u(t), t , and b which minimize 


and 

= 0 ,^ = 1 , q\ < t < /•. 

► 

< 0, = * t < t*. 

(8-4) 

Just as in Giapter 6, the variables Jc(/X u(t)^ 
and b are vectors, x(/) = [xi(0, 
tiit) = [mi(0 , and 6 = 

Any inequality constraints of the form 

G}(t,x,u,b) < 0 (8-5) 

can be transformed into a constraint of the 
form 


,xV)+ f folt.. 


J^goib. 


subject to the conditions 
dx 


(8-1) 


dt 


=At.x.u,b), < rt' 


> ( 8 - 2 ) 


®,(JC®,Jt‘) =0 J= 

'I'a =iJ.b,X^,X*) 

. t‘ 

.x(t)M(t)J>]dt = 0 , 


( 


a= 1 ,..., r' 






x(t)M(.t)J>] dt< 0 , 
a = r' + 1, r 


f 


>(8-3) 





+ |cj[/.x(r)4/(0^J 1 } c// = 0 (8-6) 


which is then a constraint of the kind of Eq. 
8-3. 


The class of problems considered is, there¬ 
fore, fairly general The essential features that 
are not included are variable limits of inte¬ 
gration, discontinuities in functions of the 
problem (u(r) may still be discontinuous), 
intermediate conditions on x(r), and state 
variable inequality constraints. 


8-2.2 EFFECTS OF SMALL CHANGES IN 
DESIGN VARIABLES AND PARAM¬ 
ETERS 

The basic idea of the direct method of 
solving optimal design problems is to first 
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construct an estimate u<*>(/), of the 
solution and then find small changes Su(t), 6b 
such that u<o>(r) + 6u(t), + 6 b is an 

improved estimate in some sense. Before the 
improvements can be determined, analysis of 
their effect on the problem must be pei^ 
formed. 

In this analysis, 6 u(t) and 6 b are required 
to be small so that a first order Taylor 
expansion of the functions of the problem is a 
good approximation. Since x(0 is the solution 
of a boundary-value problem involving «(t) 
and b, it is clear that 6 u(t), 6b will cause a 
change 6 x(t) in x(t). It is assumed here that 
the boundary-value problem for x(t) is well 
posed, see Ref. 1, page 227, so that 6 u(t), 6b 
small implies 6jc(t) small. Using this fact. 




du 


I'e-s 


d6x 3/ 3/ 3 / 

“T" = — 5jc + — 6u + — 6 b 
dt 3x du db 


« 33 

** + *." 


(8-7) 


► ( 8 - 8 ) 


3x» 




33. 




f X 

\ 


6 u 


(8-9) 


Ot ““ 1 y j flfld 

* _ 

. 9 - ( 8 - 10 ) 


In all the formulas, Eqs. 8-7 through 8-10, 
the functions are evaluated at (/.x(°>(t),M(®) 
(/), h<°>] where x(®)(0 is the solution of the 
boundary-value problem Eq. 8-2 for x(0 with 
u(0=«<®>(/)andh = ft(‘». 


To simplify the work which follows, it will 
be convenient to eliminate explicit depen¬ 
dence of Eqs. 8-7, 8-9, and 8-10 on 6 x(0. 
This elimination is performed through use of 
the differential equation adjoint to the linear 
equation for 6 x(0 in Eq. 8-8 (Ref. 2). This 
equation is 


dX df 

Tt - 

where the function h(t) will be chosen to 
obUin results needed later in the develop¬ 
ment. 

Note that for any solution X(r) of Eq. 8-11 
and any solution 6 x(t) of Eq. 8-8, 




dt 


+ h^6x + 6x X'^— 6u 
dx du 

*X^-^6b = h^6x + X^^6u 
00 


Integrating this equation from t* to r* and 
using the fundamental theorem of calculus, 
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(/> )5x(t^ ) - X^ (r®)«x(r®) = 

( 8 - 12 ) 

' lh^5x + XJ-«« + Xr dt. 

\ du bh J 

By choosing the function h(t) and the 
boundary conditions on >{t) appropriately, 
the identity Eq. 8-12 will yield the desired 
relationships. First, put h(t)-- 3/© ^/3x 
and define }J(t) as the solution 
of Eq. 8-11 with boundary conditions on 
and determined by 

X''^(r')6x* -X'^V®)«x® 

(8-«3) 

for all fix® and fix* satisfying the second 
equation of Eq. 8-8. To see that the second 
equation, Eq. 8-8, and Eq. 8-13 determine 
conditions on and ), consider the 

following procedure. Determine n of the In 
variables fix®^, fix*^., / = 1,..., n in terms of the 
remaining n of these variables. Now substitute 
the variables fix®^, fix*^. just found into Eq. 
8-13. Eq. 8-13 may now be written as a linear 
combination of n independent fix®^, fix*^. 
Since Eq. 8-13 must hold for all n indepen¬ 
dent variables fix®^, fix*^ previously identi¬ 
fied, the coefficients of all these variables 
must be zero. This is then a system of n 
equations involving only X'^fr®), X*^(r*) and 
known quantities. This procedure will be 
carried out in detail in particular problems. 

Substituting from Eq. 8-12 into Eq. 8-7 
yields 




(8-14) 

3Z» 

Likewise, put h(t) =-^ 

and define 

as the solution of Eq. 8-11 with 
boundary conditions onX^®(/®) and X^*(/*) 
determined by 

X*“^(r»)«jc* -X*“^(/®)6x® 

= -— SxO + — Sx' (8-15) 

dx® dx* 

for all fix® and fix* satisfying the second 
equation of Eq. 8-8. The identity Eq. 8-15 
determined boundary conditions just as Eq. 
8-13 did. Substituting from Eq. 8-12 into Eq. 
8-9 yields 



(8-16) 


In terms of the adjoint variables (f) and 
X “(r), the quantities 6 / and fi^^ are now 
given explicitly as functions of fiu(r) and 6 b. 
The problem is now reduced to determining 
5u(r) and 6 b, which yield the greatest re¬ 
duction in J subject to the linearized con¬ 
straints of the problem. 

It should be noted that the boundary-value 
problems for X*'(r) and X “(r), a = 1, ..., r, 
have solutions if the boundary-value problem 
of Eq. 8-8 is well posed. This is a basic 
property of adjoint boundary-value problems 
which is proved in Ref. 2. 
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8-2.3 A STEEPEST DESCENT APPROACH 

The problem of determining du(t) and db 
to minimize (maximum negative oO must 
deal with the inequality constraints, Eqs. 8-3 
and 8-4. The argument to be used here is: 
simply ignore a constraint function that is 
negative before an iteration begins. If, on the 
other hand, a constraint function is positive, 
it is required to be reduced. For example, if 
> 0 or it) > 0 for some /, then it is 
required that 6^^ = - a = 1, .... r' and 
= 1,.... rand > Oand 

60^(0 = “ /5 = 1, q' and 5«^(/) < 

- /5 = + 1, q and > 0 where 

0 < a < 1 and 0 < c < 1. The magnitude of a 
and c are chosen so that the required changes 
6 ^^ and are not excessively large. If 

and 0^(0 are not so large that the linear 
approximation is violated with a = I, or c = 1, 
then a or c are chosen as one. 


For convenience, define two sets of indices 

A - { o| > o} 

and 

I > o) . 

It should be noted that the collection B(t) of 
indices may change with the variable t. 


Define the column vector of elements 
with 0 


^ = 



(8-17) 


and a similar column vector of functions 
with > 0 


m 




- 

P^BU)- 


(8-18) 


Note that the column vector ^f) may have 
different components at different points in r® 
< t < t*. In order to assure that constraints 
are satisfied, it will be required that 


a= 

S\lf^ < a = r'+ 1, ...,r 

and oiGA 


>(8-19) 


and 

6«^(r) = - c«^(r), |3=1,...,«' \ 

6«j(r) < - c«/r), |S = q' + 1,.... q >(8-20) 
and peBO). I 


Finally, defme 

dfj , 

+T~ ^ <*- 21 ) 

ou ou 



for aU aEA (8-23) 



Note that (t) is a matrix of functions with 
m rows and the same number of columns as 
there are indices in A. The matrix of 
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constants has k rows and the same number of 
columns'as (r). 

Using the matrix notation of Eqs. 8-21 
through 8-24 in Eqs. 8-14, 8-16, and 8-17. 

«/ = £-'^66+ I \^^(t)8udt (8-25) 

= I A*^(t)Sudt. (8-26) 


G= + y^S*^)sb 

+ 7 o W^8b+y^c^ (8-29) 

such that 

bdu 

- M^(/) ^ - 27 o (8-30) 

ou “ 


Before Su(r) and Sb are determined, some 
mechanism must be set up for requiring that 
these variations are actually small. For con¬ 
venience, put 

* Sb^H'^ 66 + j 6 u^»'„(/)5u dr 

(8-27) 

where and are chosen as positive 

definite weighting matrices and cfF is to be 
chosen small enough that Su(f) and Sd are 
sufficiently small. 



= Xo£^^ + 7^£'''^ + 2yoSb^h'^ . 

(8-31) 

In the following development, it will be 
assumed that the problem is normal so that it 
is permissible to put X© = 1. Solving Eqs. 8-30 
and 8-31 for 6 u (0 and Sd, respectively, yields 


The problem is now reduced to finding 
Su(0 and 5b which minimize SJ subject to 
Eqs. 8-19, 8-20, and 8-27. This problem is 
now a special case of the Bolza problem of 
par. 6-4. According to Theorem 6-7, there 


exist multipliers Xo > 


for a > r\ /i(/) • 
q\ and 7 © with 


r«.(')] '■ ^ 


. > 0 


M^(t) > 0 for > 


H 





Su 


— 7o6tt^H'^6M * iJc^ 


(8-28) 


M 0 = -rr- 

^7o 


X A-'(0 + A *(/)7 + 


1 


(8-32) 


and 


W'*" ‘ (e-' + £* 7 ) • (8-33) 

270 ” 


In order to complete the determination of 
6 M(r) and 5b, ii{t) and 7 must be eliminated 
from Eqs. 8-32 and 8-33. A direct analytical 
elimination of M(t) and 7 is not feasible at this 
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point since Theorem 6-7 requires 

7 ^( 6 ^ = 0 

/‘^( 0 I 6 «(f) + c ^(01 =0 

and some components of 5^ and 50(f) + 
c^U) will be zero. A certain amount of logic is 
required to find y and Since only small 
du(t) and 55 are admitted, if 0 and 0 (f) are 
zero for u^®^(fX 5^®^ then very likely they 
will also be zero for M^®^(f) + 5M(f), 5^®^ + 55. 
Following this line of reasoning, it will be 
assumed first that 

50 +fl0 = 0 
and 

50(f) + c 0 (f) = 0 . 

Then y and nit) are determined by substi- 
I tuting 5u(f) and 55 from Eqs. 8-32 and 8-33 
into these equations. The multipliers y and 
m (0 are then determined and checked for the 
proper sign. If 7 ^ > 0 for a> / and > 0 
for P > q\ then this assumption is admissible. 
If, on the other hand, 7 ^ < 0 for some a > r\ 
then 50^ "*■ ~ ^ is incorrect and it must 

be that 50^ + fl0^ < 0 should occur. This is 
equivalent to simply removing 0 ^ from 0 and 
recalculating. Likewise, if yL^(t) < 0 for some 
P > q\ then 50^(f) + fl0^(f) < 0 should occur 
and 0 p(f) should be removed from ^f) and 
the multipliers recalculated. So much for the 
semi-mathematics, now to the calculations 
based on this argument. 

If ^(f) is empty for all f, then 0 is not 
defined and m(^) need not be determined. In 
case ^(f) is not empty, is to be required that 

50 

50 + c 0 = 0 = 5m + c<t> 
bu 

I 
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or 




A.-|i 

\ 3m 3m “ / 


x(\^ + \* y) 

fA.-i 

where / is the identity matrix. 


(8-36) 


Substituting du(t) and 6 b from Eqs. 8-36 
and 8-33 into Eq. 8-26 and then enforcing 5^ 
= -a^. 

It C‘‘ r 

-•^8* (fi-'+J*!)-cl A* 

® JfO 


X w. 


X (A'^ + 7) t/r = - . 

Defining 


(8-37) 




J,. 


(8-38) 


ft' 

^)A*dt 

\ 3m 3m “ / 


(8-39) 


and 

(if., 




( 8 ^ 0 ) 


Eq. 8-37 becomes 

^ + . ( 8 ^ 1 ) 

Since W^(t) is positive definite so is W~\ 
and there is a nonsingular matrix 5 (/) such 
that = s^(t)s(r). By direct multiplica¬ 

tion, it may be verified that 


2*y 

+ ( y^A*^W-^ 


i*y 



X sA*y df > 0. 


( 8 ^ 2 ) 
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Therefore, ^ is at least positive semi- 
definite. In the development that follows, it 
will be assumed that ^ is positive definite 
and, hence, nonsingular. 


Defining 

(/- ^ A*"' K *) 


Eq. 8-41 is now solved for y to obtain 
y = |^27o(4 . 

(8^3) 


It should be noted that if the set of indices A 
is empty, ^ does not exist so ^ is not even 
defined. If, in this case, ^ is defined as 
one and ^ zero, then 7 = 0 in Eqs. 8-41 and 
8-36 reduces, appropriately. In this way, a 
single mathematical analysis holds in all cases. 

Substituting Eq. 8-43 into Eqs. 8-33 and 
8-36 yields 


X A-' + \*M-l (a^ - cM^^) 

,-i - 


(M4) 




-cH'„-‘-^A* <I> (8-45) 


Sb-9*M-\ Af^ ( 8 ^ 6 ) 

and 

8 fr 2 = _ _ cM^^) (8-47) 

the expressions for 6 u(t) and Sb are simply 


Suit) = - —^ Su‘ (r) + 6 m*(/) (8-48) 

2yo 

and 

66 = - 66 * + 66 * . (8-49) 

270 


and 

66 = -:^ wr'e-' 

27o * 


+ 


1 

270 




* 


The variations 6 u(t) and Sb from Eqs. 8-48 
and 8-49 could now be substituted into Eq. 
8-27 to determine 70 . However, since dP has 
no real physical signific«nice, one might just as 
well choose 7 ©. It is interesting to note that 
the terms Su ^(0 and Sb^ are not multiplied 
by an undetermined parameter. Further, note 
that each term in the definitions, Eqs. 8-45 
and 8-47, of these quantities involves 0 and 0. 
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In fact, if 0 and ^ are zero or empty, 6w^(r) = 
6ft^ = 0. It appears that du^(t) and 6ft^ may 
be interpreted as making corrections in con¬ 
straint errors, or keeping constraint functions 
from being violated. Actually, this and more 
is true. 

Theorem 8-1: The following identities 
among 6w‘(r), 6 m^( 0, 6ft^ and 6ft^ of Eqs. 
8-44 through 8-47 hold: 

1. 6ft‘^H'^6ft2 -h I bu^^W^bu^dt^O 


2. 6ft^ + t bu}dt^ — a>lf 


3. £^^6ft‘ + I K^'^bu^dt^Q 


4.4^6u‘=0 

bu 


b(t> 

5. bu^ = — 
dw 


6. - 6ft* - 1 bu dt < 0 . 

Jf® 

By considering the case when 0 and ^ are 
empty, it is clear that in order for bu, bb to 
be in the negative gradient direction of/(i.e., 
bu = — and 6ft = — £•'), 7o > 0 is required. 


The six relationships of Theorem 8-1 give 
the designer an intuitive feel for the Method 

8-nJ 


of Steepest Descent. First, Relation 1 states 
that the changes 6w*(r), 6ft* and bu^{t\bb^ 
are orthogonal. Relations 2 and 5 show that 
6w^(r), 6ft^ provides the requested reduction 
in the constraint functions. Relations 3 and 4 
show, as might be expected due to the 
orthogonality of Relation 1, that 6w*(r), 6ft* 
has no effect on the constraint functions in 0 
and Finally, Relation 6, along with 

Eqs. 8-48 and 8-49, simply states that if ^ = 0 
and = 0 then bu{t\ bb provides a 

reduction in/. 

Before stating a computational algorithm, 
it is important to develop a test for con¬ 
vergence to the solution of the original 
problem. The procedure here will be to show, 
through use of the necessary conditions of 
Chapter 6, that as the solution of the original 
problem is approached, 6 m* (r) and 6ft* must 
approach zero. 

By Theorem 6-7, at the solution of the 
problem, Eqs. 8-1 through 8-4, there are 
multipliers / = 1, .... n, a = 1,.... r. 

and {^(0,13 = 1,.... q such that for 

(8-50) 

and 


O=go+t'^g 

(8-51) 

it is required that 


dcj _ d//^ 

Tt 17“ 

(8-52) 

bH 


II 

o 

(8-53) 

ir-\ aT""'" 

(8-54) 
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a = r'+l,....r (8-55) 

= 0, /3 = ^' + 1,..., q. (8-56) 

Corresponding to the definition of \j/ and 4 > 
in Eqs. 8-17 and 8-18, define v, L,gy and |as 
containing only components of p, L, g, and 
corresponding to elements of ^ and 0. In this 
notation - and due to Eqs. 8-55 and 8-56 — 
Eqs. 8-52, 8-53, and 8-54 become 


duT _ 

r V 
CO 

Vo .T 

+ +p^ 

bL 

(8-57) 

~di 

bx 

bx 

Tx 

r V 

(ji‘ ——. 


-T yT 

p‘ -— 


= 0 

au 

bu 

bu ' 

bu 






(8-58) 


i£L +vrj^ 
bb bb 



bb 



df = 0. 


(8-59) 

Substituting from Eq. 8-11 into Eq. 8-57 
yields 


du dk^ dk* 

- + - +- V 

dt dt dt 


a/"' 3/^ a/’’ - 

- — oj- — y - — k*v 
bx bx bx 


(8-60) 


or 


(w + + X* y) = 

dt 


bf^ 

—-— (w + X"^ + X* v) . 
bx 


(8-61) 


Further conditions from Theorem 6-7 are, 
from Eq. 6*124, 


bG^ 

ax® 

and 


w(r®) = 0 


bG^ 

ax' 


+ «(r') = 0 


(8-62) 


(8-63) 


Multiplying Eqs. 8-62 and 8-63 by 6x® and 
6x‘ yields 


age 

ax® 


6x® +i 


ag 

ax® 


6x®-w^(r®)6x® =0 


and 


ago 

ax* 


6x* + v' 


ag 

ax' 


6x* 


+ w^(r*)«x* = 0. 


(8-64) 


(8-65) 


These equations hold for all fix® and fix'. 
Adding Eqs. 8-64 and 8-65, 


ago 

ax® 


6x® + 


ago 

ax' 


6x* 




- + a;^(/‘ )6x‘ = 0 . (8-66) 

Again, Eq. 8-66 holds for all 6x^ and dx ^. 

Substituting terms from Eqs. 8-13 and 8-15 
into Eq. 8-66, 


X-^^(/')6x' -X-^^(r®)6x® 




(r*)6x*-X'<' (/' 


®)6x®J 


- co^(r®)6x® + w^(r* )6x' = 0 (8-67) 

8-11 
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for all 6x® and dx^ satisfying the second 
equation of Eq. 8-8. By collecting terms, Eq. 
8-67 becomes 

) + v^\* ^(r‘ )]6x> 


Premultiplying the transpose of Eq. 8-70 by 
(b^/du)W~^ yields 

bu du ^ 


- |^w^(r®) + X-^^(r®) + v\* ^(r®)j6x® =0 

1 

1 




bu “ 

\ bu 

bu ) 

(8-68) 




for all and 6 jc‘ satisfying the second 

equation of Eq. 8-8. 

bu “ 

\ bu 



(8-72) 


Eqs. 8-61 and 8-68 constitute the bound¬ 
ary-value problem adjoint to Eq. 8-8, where 
the dependent variable is (co + X*' + X^ P). Due 
to the assumed well posed nature of Eq. 8-2, 
the boundary-value problem of Eq. 8-8 has a 
unique solution for all 6u(r) and db. It is 
shown in Ref. 2, Chapter 4, that in this case 
the adjoint boundary-value problem, Eqs. 
8-61 and 8-68, has a unique null solution, i.e.. 


The coefficient of { in Eq. 8-72 is just A^(/) 
of Eq. 8-34 which is nonsingular. Therefore, 

('''*''*«)] . 

(8-73) 

Substituting Eq. 8-73 into Eq. 8-71, 


w(/) + X-^(r) + X* (r) = 0, r® < t < . 

(8-69) 



Substituting for a>(f) from Eq. 8-69 into 
Eqs. 8-58 and 8-59 yields 



db 




dt-^0 


' ' bu bu bu 

Zt 9^ 

-^=0 (8-70) 


or, in the notation of Eqs. 8-22 and 8-24, 

+ C* p = 0 . (8-74) 

Substituting for { in Eq. 8-73 into Eq. 
8-70, 


ii®. + j;r iL 
bb bb 




a' - A* v+ A® * W * A^ 
bu bu " 


X 


K 

bb 


8/o 

bb 



dr = 0. 



d0 

bu 


W-^ A*p = 0. 


(8-71) 


(8-75) 
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Premultiplying Eq. 8-75 by and 

int^rating yields 

Jf* 


, 3 ^ \ 

A* -:^W-^A')dt 
du du " / 



^W-'A* 


-A*^W-^ A* ‘ 
bu 

X »'-• </r j i? = 0 (8-76) 

Premultiplying Eq. 8-74 by yields, 

^ -1 ^ H'-1 i; = 0 . (8-77) 

Adding Eqs. 8-76 and 8-77 finally yields 


so 


or, 


/ ^ . a|_ \ 

\ du du “ / 

^ = 0 . (8-79) 

Eq. 8-79 is the desired result, 6 m* (t) = 0. 




x[a^ 


Substituting v from Eq. 8-78 into Eq. 8-74 
yields 

8^-e'<'Af;‘Af^^=o 


and this implies, by Eq. 8-46, that 66* = 0 


It is now possible to state a computational 
algorithm employing the results of the pre¬ 
ceding analysis and discussion. 


A Computational Algorithm: 

Step 1. Make an engineering estimate 
of the optimum de¬ 
sign function and parameter. 

Step 2. Solve Eq. 8-2 for correspond¬ 
ing to 


Step 3. Oieck constraints and form ^ and 
^0 of Eqs. 8-17 and 8-18. 




(8-78) 


Substituting v from Eq. 8-78 into Eq. 8-75, 


-A* 


-f 


Step 4. Solve the differential equation 
8-11 with h and the boundary 
conditions of Eqs. 8-13 and 8-15 
to obtain X*^(0 and re¬ 

spectively. 

Step 5. Compute A^(t\ A'^ (r), and 

in Eqs. 8-21 through 8-24 and 
A^(t) in Eq. 8-34. 


+ 


du 


A^ 


80 

du 


H'.: 




Step 6. Choose the correction factors a and 
c in Eqs. 8-19 and 8-20. 
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Step 7. Compute ^ , and ^ in 

Eqs. 8-38, 8-39, and 8 ^ 0 . 

Step 8 . Choose 7 o > 0 and compute y 
and yit) of Eqs. 8-43 and 8-35. If 
any components of 7 with a > r', 
or yit) whh > q\ are negative, 
redeHne ^ and ^t) by deleting 
corresponding terms and return to 
Step 5. 

Step 9. Compute 6 ^^ and 

Sb^ of Eqs. 8-44 through 8 - 47 . 

Step 10. Compute 


= - -r— du^it) 

27o 

+ 6«^(0 

6(i) = 6(0)_ • 5*1 +5*2 

270 

Step 11. If the constraints are satisfied and 
6«*(0 and db' are sufHciently 
small, terminate. Otherwise, return 
to Step 2 with being 

replaced by 


An algorithm of this kind invariably in¬ 
volves a certain amount of computational art. 
The critical element of this algorithm is the 
choice of the parameter 7 © in Step 8 . Once 
the constraints are satisfied to acceptable 
accuracy, du^it) and db^ will be approxi¬ 
mately zero and l/( 27 o)can be viewed as a 
step size in the direction 5^*. In 

this case the change in u(r) and b is 

6«(0 = - duHO 
27o 


6!> = - 



db' . 


8-14 


Substituting these expressions into Eq. 8-25 
and using Eqs. 8-44 and 8-46, 

4' A'V 


where 




With Eq. 8-80 it is possible to request a 
reasonable magnitude for SJ and compute the 
7o which should give this reduction in /. In 
this way, it is possible to choose a reasonable 
7 o. Experience with this method on structural 
design problems, of the kind discussed in the 
following chapter, has indicated that a request 
of 2 % to 10 % reduction in the cost function 
on the first iteration gives a value of 7 © that 
yields convergence. Often, this value of 70 
must be adjusted during the iterative process 
to prevent divergence or to speed con¬ 
vergence. 


This matter of choosing step size in Step 8 
requires a great deal more attention. With a 
little experience one can develop a “feel” for 
how to adjust 7 © to get good convergence, 
even in complex problems. A feasible auto¬ 
matic method of choosing 70 is desirable for 
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use on high-speed computers. No reliable 
method is known to the writer at this time. 


8-3 A STEEPEST DESCENT METHOD FOR 
A GENERAL OPTIMAL DESIGN PROB¬ 
LEM 

8-3.1 THE PROBLEM CONSIDERED 

The basic optimal design problem with 
fixed endpoints, no discontinuities, and no 
intermediate constraints was treated in the 
preceding paragraph. The problem considered 
here will be a generalization of that problem 
to include features such as variable endpoints, 
discontinuities, and intermediate constraints. 
The basic idea of the method of solution will 
be the same as in the preceding paragraph. 
Accounting for the additional features of this 
problem, however, introduces some com¬ 
plexity into the derivation of equations. 

The problem to be treated here is to 
determine «(/), < t < h, and r®, .... 

which minimize 




I fii[t.x{t),U 


subject to the conditions 


(8-81) 





[t,x(r),«(0,ftldr = 0, 


a= 1 


> (8-84) 




^ La \t>x(t)Mt)M dt < 0, 


« = '■'+!. r 


and 


0^(r,w) = O, 

< 0, r® < r < 

1,.... q. 


> (8-85) 


Note that this is just a special case of the 
I^oblem of Def. 6-3. Equality constraints will 
be included in Eqs. 8-84 and 8-85 in a natural 
way during the development. It is assumed 
that for given u{t)jbj^y and the boundary- 
value problem, Eq. 8-82, has a continuous 
solution x(j). If constraints of the form 
<jj(t,x,u,b) < 0 occur, they may be replaced 
by a constraint of the form 


dx 3 

— f(t.x,u,b), 
at 


d,(t®,x«,t'',x'') = 0,j= 1. n 


>(8-82) 


/ 


ii'(t',x') = 0,/= 1.1? (8-83) 


•,n 

I jw[f,x(0.«(0,61 

( 8 - 86 ) 

+ |w[/.x(t).«(/),*]|} dr = 0. 


Constraints of the form of Eq. 8-85 are easily 
treated in a direct manner so they need not be 
reduced to the form of Eq. 8-86. 
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Just as in the method of par. 8-2, the 
idea here will be to estimate 
and r®, and then to allow small 

changes du(t) and db. The object is to 
determine 5u(/) and Sb which yield the 
greatest reduction in J and which satisfy 
the constraints. 


where Ajc^ is the total change in at the 
point Since x(t) is to be continuous before 
and after the variation, the total change in 
x(t) must be continuous at each point t. This 
requires that 

Sx^it^ - 0) - 0)6/^ = Ajc^ 


8-3.2 THE EFFECT OF SMALL CHANGES 
IN DESIGN VARIABLES AND 
PARAMETERS 

Before the optimum changes in u(t) and b 
may be determined, the effect of these 
changes on / and must be assessed. Since/ 
and \p^ have the same form, the expressions 
for change of a general functional 

f'” 

Q = g(b.t>.xl) + \ 1 dt 

J f* 

(8-87) 


= bx^it^ + 0 ) +/(/' + 0 ) 6 /^ ( 8 - 89 ) 

I = 0,1,..., 17 , where 6x(0 and are 
indpendent changes in x(0 and 

The independent variation in x(0, 5x(Of is 
related to 6u(0 and 6b through the varia¬ 
tional equation 


d 3/ 3f 3f 

— (6jc)=^6jc+^6u+^6^ 
dt 3x 3u 3b 


< t < 


(8-90) 


will be determined and the result will be 
applied to / and . 


The boundary conditions, Eq. 8-82, require 
that 


Expanding Q to Hrst-order terms in the 
variables u(t\b,^, and x^, yields 


36 36, 36, 

_L ^0 +—+— 

doc® 3/® 


Ax® 


6e=i|5ft+|^Ax®+...+^Ax/ (8-88) 


+... +1^ Ax® +-^ 6r® +... +-^ htf 
dx® dr® y 


+... +-^ ar® - F(r® + 0)«r® +... 
dr® 

+ {F(tl - 0) - F(tl + 0)] ht! +... 

+ F(r® -0)«r® 


+ 



dF , dF , dF 

dx dM bb ) 


dt. 


be, 

+—6r®=0 (8-91) 


5=1, .... n. Finally, the relations of Eq. 8-83 
require that 


dn' ^ , dft' ^ „ 

-Ax'+-6r' = 0, 

dx' dr' 


/ = 0,1,...,1I. (8-92) 

It is clear that the variations 6x(r), 6u(t), 6b, 
5/^, and Ax^ are not all indpendent. 

In order to express 6Q in terms of only 
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6u(0 and S3 which are to be determined, 
introduce the adjoint variable X(r) just as in 
Eq. 8-91, 


X- — 

dt dx dx 


(8-93) 


Integrating the identity below Eq. 8-91 from 
to ^, 0 < /,/ + 1 < 17 , one obtains, just as 
in Eq. 8-92, 


-0)6jc(r^'"^ -0) 
- + 0)6jc(r^ + 0) 


J,/ \ du 


+ dt- (8-94) 

Note that the boundary conditions on X(r) 
and the properties of X(r) at the points / = 
1 , ..., 1 ? - 1 , have not yet been specified. 
These boundary and intermediate conditions 
on X(r) will be the m^'or output of this 
subparagraph. 


Summing all the formulas, Eq. 8-94, over / 
= 0 , 1 ,..., 17 — 1 , one obtains 


^ bF 

— Sxdt = -\^ (r® + 0)6jc(r® + 0) 

^0 bx 

+... + X^(r^ - 0)6jc(r^ - 0) 

- X^(r^ + 0)6jc(r^ + 0) +... (8-95) 

+ X^(r^ -0)6jc(r^ -0) 




) 


Or, using the definition of /ix‘, this is 
dF - 

\ — Sxdt = - X^(r® + 0) Ax® 

+ X^(r® + 0)/(r® + 0)6r® 

+... + [X^(r^ - 0) 

- X^(r^ + 0)1 Ax> 

- [X^(r^ - 0)/(r/ - 0) 
-X^(r^ + 0)/(f^ + 0)16r/ 

+ ... + X^(r’' -0)Ax® 

- X^(r® - O)^^ - 0)«r’' 



+ X^ Sb)dt. 
db f 


Substituting from Eq. 8-95 into Eq. 8 - 88 , 
yields 

+1^-^ - x^(f^ - 0) + + o)j Ax^ 

I'ii 

- X^(r» + 0)/(f® + 0)j «r« 
+F(r^-0)-F(r^ + 0) 
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dg 

+ ...+ +F(r^ -0) 


-X^(// + 0)/(/^ + 0)J St^ 

+ x^(t^ -0)/(/^ -0)J«r’» 

•'f® 

(8-96) 

The quantities Ajc^ and 5/^ appearing in Eq. 
8-96 are required to satisfy the conditions of 
Eqs. 8-91 and 8-92. The objective now is to 
choose the boundary and intermediate condi¬ 
tions on X(r) so that Eq. 8-96 is independent 
of Ax^ and 6t^; i.e., so that 


+ \^(t^ - 0)/(r^ - 0)J dt^ = 0 

(8-97) 

for all Ax^ and 6t^ satisfying Eqs. 8-90 and 
8-92. 

In order to determine conditions on X(r^ 
based on Eq. 8-97, a Lemma is required. 

Lemma 8-1: For A, / = 1, ...» m < n in 
/?" if i4^jc = 0 for all jc in such that 

Bfx-Oy /= 1,..., m 

then there exist constants such that 

m 

A^x-^Z^ ufiJx^O (8-98) 

for allx in/?^. 

For proof of this Lemma see Ref. 3, page 

12 . 


+ 1^-^ - - 0) + X^(r^ + 0)j Axf 

- X^(r» + 0)/(r® + 0)j 6r» 

- F(tl + 0) + X^(f^ - 0)f{tl - 0) 

-X^(f^+0)/(f^ + 0)j«r^ 

8-18 


Consider the expression of Eq. 8-97 as 
A^Xy where the components of A depend on 
the values of X(r^) and x = (Ax®^, ..., Ax®^, 
6/®, ...» 6/^)^. The equalities X = 0 are just 
Eqs. 8-91 and 8-92. Denoting the multipliers 
cj/, as = 1 , ..., rty and = 0, 1,..., 77 , Eq. 
8-98 becomes 



+ X^(r»+0)+ 2 T,-^ 


1 


dx® 


3«®1 

ax®J 

•4-^ 


Ax® 

- - 0 ) + + 0 ) 
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+ Z T + y -^1 Ax’* 

*«i * dx^ ” dx" J 

1^-^ - F(r» + 0) - x^(f» + 0)/(r» + 0) 

\.W 

+ - QDf{t> - 0) 

- \^(tl + Qi)f{tf + 0) + 7, j 

n 

+ - om/" - 0) + S t, ^ 


+ ... + 


Ic „ n 

+7„-rr =0 

" df** 


- 6^1 = ( 


(8-99) 


for all Axf,dt^, / = 0,1,..., i?. Therefore, 




an* „ 


( 8 - 100 ) 


3«/ 


- X^(r/ - 0) + X^(r/ + 0)+7, = 0 

( 8 - 101 ) 


Bg ^ "Be 

-x^(r’' -0)+ 2 T.-^ 

dx” »=i * doc" 

aft" „ 

+ 7 - * 0 

^ ax" 


( 8 - 102 ) 


- F(r® + 0) - X^(r® + 0)/(r® + 0) 

+ z T. 4^ + 70 =0 (8-103) 


»=I » ar® 


ar® 


9« 


ar- 


+ F(r^-0)-f=i(r^-0) 

+ X^(r/ - 0)/(r^ - 0) 
- X^(r^ + 0)/(r/ + 0) 


an. 




= 0 


(8-104) 


+ F(r’' - 0) + X^(r® - 0)/(r’' - 0) 
.Zr,4^.7,^ = 0. (8-105) 


The object is now to eliminate the and 
in order to obtain explicit conditions on X(r). 
Postmultiplying Eq. 8-100 by /(r® + 0) and 
adding Eq. 8-103 yields 


n 

z 

»•! 

+ 7 on®(r® + 0 ) = 0 


i(r® + 0) - F(r® + 0) + Z T,tf,(r® + 0) 

(8-106) 


where 


38 


+ -^ /Ir' ± 0, x(r' ± 0), «(/' ± 0), *1 
Bx‘ 

(8-107) 
8-19 
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d/r' ± 0) = 


dt‘ 


dd 


+ ^ flt‘ ± 0 , x(t‘ ± 0 ). Uit^ ± 0)^1 
dx‘ 

(8-108) 


and 

+ /(/' ± 0,xil^ ± 0), Uit* ±ow. 

dx‘ 

(8-109) 

Since S2®(r®.x®) = 0 is to determine r®, the 
total derivative with respect to r®, + 0 ), 

should not be zero. Therefore, 7 © niay be 
determined as 


1 


ft»(r«+0) 

X [i(r® + 0) - F(r« + 0) 


n 

■ 2 T^.(r® + 0) . (8-1 

1*1 ' ' 


10 ) 


Substituting Eq. 8-110 into Eq. 8-100 yields 

igr n 

X(r®+0) = — * 


ax® 


- 2 T. 


ax® 


1 r. 

n®(r® -► 0 ) 


i(r®+0) 


- F(r® + 0) 

n 1 0^0 r 

+ J^r,9,(r®+0)J-^- (8-111) 

This is then a boundary condition on \(t) at 

r®. 

In exactly the same way, postmultiplying 
8-20 


Eq. 8-102 by - 0) and adding to Eq. 
8-105 yields 

i(r® - 0) +F(f® - 0) + z 
1*1 

+ 7^^i’'(/’» -0) = 0. 

Solving for and substituting into Eq. 8-102 
yields 


ag^ " 

X(/®-0) = ^ + 2 T, 
ax® *“i ' 


dej 

ax® 


1 

fi'id” - 0) 


|^i(r® -0)+F(t® -0) 

+ rAa®-0)j 


T 


( 8 - 112 ) 

where g - 0 ) and 6^ - 0 ) are deflned in 

Eqs. 8-107 and 8-108. Eq. 8-112 then serves 
as a boundary condition on X(r) at 


In order to use Eqs. 8-111 and 8-112 as 
explicit conditions on X(/) at r® and , the 
parameters must be eliminated. This may 
be accomplished by alegbraic manipulation in 
particular cases. To illustrate this idea on a 
problem which has been treated extensively in 
the literature (Refs. 5,7,8,9), consider the 
case in which a full set of initial conditions 
is given, i.e., 


ei,(r®,x®) = 0, 5 =l,...,n. (8-113) 

In this case, Eq. 8-112 yields 

X(/® - 0) _ 0 ) 


X ri(/® - 0)+fXf® - 0)1 • 

L -I 9JC® (8.114) 
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Eq. 8-111, on the other hand, is just a vector 
equation with n components which deter¬ 
mines ri,...,r^. It gives no explicit informa¬ 
tion on X(/® + 0). 

Now the boundary conditions on X(r) have 
been determined. It remains,however, to de¬ 
termine jump conditions on X(r) at the 
intermediate points tL Postmultiplying Eq. 
8-101 by/(r^ ± 0) and adding to Eq. 8-104 
yields 

g{tf ± 0 ) + Fit^ - 0 ) - F{tf + 0 ) 


Htt + 0 ) - - 0 ) = - — ( 8 - 116 ) 

bxf 

+ -r-^ - UV± 0 ) + F(r^- 0 ) 

Sl/(tf± 0 ) \ 

- Fitf + 0) 

- - 0) [fit* ± - 0)1 

+ 0) [f(t^ ± 0) - f(tf + 0)] 



- - 0) ± 0) -f{ti - 0)] 

+ + 0 ) lfit‘ ± 0 ) - fit> + 0)1 

+ ±0)=‘0 (8-115) 

where the notation of Eqs. 8-107 through 
8-109 has been used. The choice of limit from 
the right or left (the plus or minus sign, 
respectively) in Eq. 8-115 is left open for 
now. One or the other alternative will be 
chosen for computational convenience. 

It is assumed that the condition = 

0 determines as a function of so it is 
required that the total derivative of with 
respect to ± 0), not be zero. 

Therefore, from Eq. 8-115, 

T> = --r—^- ji(r/±0) + F(f^-0) 

- F(tl + 0) 


This equation, the boundary conditions at 
Eqs. 8-111 and 8-114, and the differential 
equation, Eq. 8-93, are to determine the 
adjoint variable, X(r), r® < r < r^. The 
boundary and intermediate conditions on X(r) 
were constructed so that Eq. 8-97 holds and 
in turn, Eq. 8-96 becomes 



+ 



dt 


or 


- X^(r/ - 0) w ± 0) -fitt - 0)1 

+ + 0 ) ± 0 ) - fit! + 0)11 . 

Substituting this expression into Eq. 8-101, 


This equation meets the objective of this 
subparagraph, namely, determination of the 
dependence of bQ on bb and buit) explicitly. 
Since Q was any functional, this result can be 
applied to the particular functionals of the 
present problem, J and . To obtain bJ and 
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6^^, define X*^(r) and as the solution 

of Eqs. 8-93, 8-111, 8-114, and 8-116 with 

g = ^0 and ^ = /o ^ot \^(t) ( 8 - 118 ) 

and 

g = and F = for X'^“(0. (8-119) 

In this notation, Eq. 8-117 yields 



dt (8-122) 


and 




bu 


X'^«(0. (8-123) 



In this notation 


6/ = £*' 6f>+\ A*' (t)bu{t)dt (8-124) 


and 


6^ * £'<'« 


r 

r 

' 66 + 1 
Jf* 


A*® (t)hu{t)dt . 

(8-125) 


and 




I 3* 3 ,. L 

-'■'<'> If] 


dt ) 66 


3/ 


+ X’'“^(0 


du 


6u(r)dr. 


For a more compact notation define 

.1 




f [f 




( 8 - 120 ) 

( 8 - 121 ) 


The problem of this paragraph is now in 
approximately the same state as the problem 
of par. 8-2 was in Eqs. 8-14 and 8-16. Before 
proceeding to derive a steepest descent al¬ 
gorithm, however, several comments are in 
order. 

First, the choice of limit from the right or 
left was not made in Eq. 8-116. This choice is 
generally made depending on the distribution 
of boundary conditions on the state variable. 
If most of the boundary conditions on x(r) 
are given at t®, for example, then most of the 
boundary conditions on X(r) will be given at 
Since the adjoint equations are linear, 
superposition techniques may be used to solve 
the boundary-value problem. These tech¬ 
niques involve several integrations of Eq. 8-93 
from to r® with different starting con¬ 
ditions at . These integrations must account 
for the jump condition, Eq. 8-116. The 
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integration then proceeds from the right and 
Eq. 8-116 should then be used to determine 
X(r^ - 0 ) in terms of X(r^ + 0 ) so that the 
integration may continue. For this reason, the 
minus sign is chosen in Eq. 8-116 so that 

X(f/ - 0) = w + 0) + — 
dxf 


1 

- 0 ) 


- 0 ) 


# 0 . 

One might argue that the function x(t) is 
close to the actual state and examine the 
effect on f 2 ^[Kjc{r^)] of altering i.e., 

SSlf[tf,x{th] = — St> 

dx dt 

(8-127) 


+ F(tl - 0) - Fit' + 0) 

+ X(r^ + 0)[/(r^-0) 

T 

-/(r^ + O))-^. (8-126) 
hx! 

Since the state equations have previously been 
integrated, - 0 ) and - 0 ) can be 
computed in Eqs. 8-107 and 8-109. 

The second matter that requires discussion 
is the determination of and its variation, / = 
0,1, ..., 17 . If the state equations form an 
initial-value problem (all initial conditions 
given) then one can make an estimate for u(t) 
and b and integrate Eq. 8-93 from toward 
(or toward r® if all boundary conditions 
are given at ). As the integration progresses, 
can be monitored and the value of t 
for which it is zero is called tL The situation is 
not so easy in case the state equations form a 
boundary-value problem. 


where the plus or minus sign is chosen 
depending on whether should be positive 
or negative to make f 2 ^(r^ + r^), x{t^ + 1^) = 0 . 
The change bt^ is then chosen and if it is not 
too large, the state equations need not be 
re-integrated. This argument corresponds to a 
Newton-type algorithm for the determination 
of tL This procedure should be used after 
every variation in u(t) and b and subsequent 
integration of the state equations, since x(t) 
will be altered with an accompanying altera¬ 
tion in tL 

8-3.3 A STEEPEST DESCENT COMPUTA¬ 
TIONAL ALGORITHM 

The problem of determining 6 M(r) and bb 
which reduce J and satisfy other constraints 
will now be solved just as in par. 8-2. As in 
the preceding paragraph, if some 0 ^[/,u(r)] or 
is less than zero, it will be ignored. If, on 
the other hand, > 0 or 0 ^[/.w(/)] > 0 , 
then it will be required that 


One method of determining requires that 
a reasonable estimate of be available, 
perhaps from engineering intuition or prelim¬ 
inary analysis. The state equations are then 
integrated using the engineering estimates for 
u{t) and b. It is likely that for the solution 
x(t\ 




and 

5 ^, = - 

where 0 < a < 1 and 0< c< 1 . 
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Just as in par. 8-2, define two sets of 
indices 


y4 = {al^„[x<®>,«(«>,*<»>] > O) 
and 

5(0= > o} 


where and are the beginning 

estimate of the design variable and design 
parameter, respectively, and is the 

associated solution of the state equations. 
Further, define the column vectors of con¬ 
straint functions 



(8-128) 


and 


0(0 = 


_ . 


(8-129) 


and 


\Ut) = 


r A*“(o 1 


(8-133) 


L a^A -> 

That is, the columns of 8* and A* (r) are 8*“ 
and A*® (t) for those a with > 0. Now, 

dJ = + j (t)bu(t)dt (8-134) 


and 


hi = 9* 


T 

6b + 



T 

(t)bu(t)dt . 


(8-135) 


The problem of this paragraph is now to 
find bu{t) and 6b to minimize 6/, subject to 
Eqs. 8-130 and 8-131. Although the symbols 
have a slightly different origin, this problem is 
precisely the same as that given by Eqs. 8-19, 
8-20, and 8-25 of par. 8-2. All the analysis 
required to determine bu(t) and 6b follows 
and Theorem 8-1 holds. The only difference is 
that in Theorem 8-1 must be interpreted as 
in the present problem. 


By the argument of par. 8-2, it will be 
required that 

60 = —a0 (8-130) 

and 

60(/) = -c0(r). (8-131) 


The algorithm of par. 8-2 may now be 
given with references to equations of this 
paragraph. 

Algorithm: 

Step 1. Make an engineering estimate 
b^®^ of the optimum de¬ 
sign function and parameter. 


Using the notation of Eqs. 8-122 and Step 2. Estimate t\ t\ ..., and solve 

8-123, define the matrices for x®(r). 
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Step 3. Adjust as required by the discus¬ 
sion below Eq. 8-127 and recom- 
(8-132) pute jc® (r) if required. 
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Step 4. Check constraints and form ip 
and 0 (/)of Eqs. 8-128 and 8-129. 

Step 5. Solve the differential equation, Eq. 
8-93, with boundary and inter¬ 
mediate conditions of Eqs. 8-111, 
8-114, and 8-116. The solutions 
corresponding to the functions, 
Eqs. 8-118 and 8-119, yield X ^(0 
and respectively. 

Step 6 . Compute aHO, , A* (r), and 
A*(0, in Eqs. 8-120, 8-121,8-132, 
8-133, and 8-34, respectively. 

Step 7. Choose the correction factors a 
and c in Eqs. 8-130 and 8-131. 

Step 8 . Compute ^, and in 

Eqs. 8-38, 8-39, and 8-40. 

Step 9. Choose To > 0 and compute t and 
ix(t) of Eqs. 8-43 and 8-35. If any 
components of t, with a > r\ or 
m( 0 , with 0 > ^ 7 ' are negative, 
redefine ip and 0(0 by deleting 
corresponding terms and return to 
Step 6 . 

Step. 10. Compute 

and db^ of Eqs. 8-44 through 
8-47. 

Step. 11. Compute 

2To 

+ 6w2(0 

7 ,( 1 ) = 7 ,( 0 )- L 57,1 + 57,2 

2To 

Step 12. If the constraints are satisfied and 
and 6 b ‘ are sufficiently 


small, terminate. Otherwise, pro¬ 
ceed to Step 13. 

Step 13. Adjust r®, r*,.... as required by 
the discussion below Eq. 8-127. 
Return to Step 2 with 
being replaced by and 7 ,( 1 )^ 

For an alternate development of the al¬ 
gorithm in the special case of a full set of 
initial conditions, see Refs. 5 and 7. Several 
example problems are solved in Ref. 5. 


8-4 STEEPEST DESCENT PROGRAMMING 
FOR A CLASS OF SYSTEMS DE¬ 
SCRIBED BY PARTIAL DIFFEREN¬ 
TIAL EQUATIONS 

8-4.1 THE CLASS OF PROBLEMS CON¬ 
SIDERED 

Thus far, all problems considered have had 
their state variable specified by algebraic 
equations or boundary-value problems in 
ordinary differential equations. It is possible, 
however, that the state of the system being 
considered is governed by a boundary-value 
problem with partial differential equations. In 
such cases, the state and design variables are 
functions of more than one independent 
variable. One may then think of the design 
variable as being distributed over an area, 
volume, or higher dimensional space. For this 
reason, such problems have been described as 
distributed parameter systems. 

A great deal of work has been done on 
distributed parameter systems which have a 
time-like variable (Refs. 12,13); i.e., a variable 
which makes the governing differential equa¬ 
tion hyperbolic or parabolic. In this para¬ 
graph, consideration will be limited to static 
problems such as equilibrium of plates, shells, 
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etc. These problems are described by linear, 
elliptic, partial differential equations (Ref. 1). 

The boundary-value problem treated will 
be denoted 

L{M,b)[z] =G(JC.«,b),Jc€n (8-136) 

B (V, b)[2 ] = ^7 (jc, V, b), JcGr (8-137) 


|a| = ai + ...+aj^, |/3| = /3i + ...+ 

The object of the problem is to determine 
u(x\ x&l, v(jc). jc€r, and b such that 

/ g^(x,z,v,b)dT 
r 

+ // fo(x.z.u.b)dil (8-140) 


where x = (xi,jc 2 ,...,jCj^ )^ is the independent 
variable which ranges over the domain in 
R* with boundary T. The vector uix) = 
[«i(jc), ..., u^(jc)]^ is the design variable 
over v(jc) = [Vi(jc),..., v^(jc)l ^ is the design 
variable over V (boundary design), and b = 
(bi,...» b^)^ is the design parameter. The state 
variable z(x) = [zi(x), ...» z^(x)]^ is to be 
determined by the boundary-value problem, 
Eqs. 8-136 and 8-137, which is linear once u, 
V, and b are specified. It is important to note, 
however, that the problem depends in a 
nonlinear way on u, v, and b. 

An example of the form of the differential 
operators L(u,b)[z] andR(v,b)[z] is 


is a minimum subject to the constraints of 
Eqs. 8-136 and 8-137, 

/ g^{x, 2 ,v,b)dr 

r “ 

+ // LAx.z,u,b)dSl = 0, 

n 

S g„ix,z.v.b)dr 
+ jy L^(x,z,u.b)dSl < 0, 

a = r'+l .r 


1 (8-141) 


L(u,b)[z] = Z a {x.u.b) 
l<»l<n 


a lair (8-138) 

X ■■ - . -r—j 


djc"‘ ... 


and 


B{y,b){z'\ = Z b^(x,v,b) 


dW\z 


-,xer 


(8-139) 


... 9x][* 

where 

a = (o,, ...,o^)^,|3 = 0J,. 
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♦,(ac,«) = 0, xeSl, /= 
«,(x.«)< 0,xe«, / = t 


(8-142) 


and 


cjy(x,v)*o, xer. /=!,..., f' 
coy(X, v) < 0, xer, / = f' + 1,..., f. 


(8-143) 


/ 


The method of solving this problem will be 
similar to the methods of pars. 8-2 and 8-3. 
An estimate «l®>(jc), vl®>(jc), and will be 
made and changes sought which reduce /, 
subject to the constraints of the problem. 
Before desirable changes in and 

may be determined, of course, their 
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effect on function in the problem must be 
examined. 


8-4^ EFFECT OF SMALL CHANGES IN 
DESIGN VARIABLES AND 
PARAMETERS 

It will be assumed in the following that the 
boundary-value problem, Eqs. 8-136 and 
8-137, is well-behaved in the sense that small 
changes 6 m in 6v in and 66 in 6^®^ 
yield a new solution + 6z (where z^®^ is 
the solution corresponding to the estimated 
design functions and parameters), where 6z is 
small. 

To first order, 6z must satisfy the lin¬ 
earized boundary-value problem. 


for jc€r, where 

A^Hx.u,b)iz] =^L{x.u.b)[z] 
“ du 

A.L(x,u,b)lz] =-^L(x.u,b)[z] 
at) 

A B(x,v,b)lz] =^B(x,v,b)lz] 
^ av 

A^B(x,v,b)lz] =-^B(x.v,b)lz] 


\ 


} (8-146) 


/ 


For convenience in the following develop¬ 
ment, the arguments of L and B will always 
be taken as m^®\ and 6^®^ 

The functionals J and are of the same 
general form, so, for their analysis define 


6<®>) (6z) + [r<»>] 6u 

+ [z<»>] Sb 


P= S g(.x,z.v,b)dr + SS F(x,z,u,b)dSl. 

'' " (8-147) 


bu 


a* (8-144) 


Once the dependence of P on changes in m, v 
and b is determined, the result may be applied 
directly to J and . 

To first order terms, 


for jc€n and 

B (V<®>,6<®>1 («z) + [vW 6<®>) (z<®>] 6v 

+ Aj5[v<®>,a<®>)[z<®>]«a 

(jt, >^®\ a*®^ ] 6 V 
dv 

+-^ (x,v<®>.6<®>]6a 



(8-148) 


In order to make use of Eq. 8-148 in the 
determination of 6m, 6v, and 66, it is desirable 
to eliminate explicit dependence on 6z. This 
is done through use of the adjoint operator 
L* defined by 
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jJix^i(M.fc)(6z] -6z^Z-*(M,ft)(X)} dSl 
= jAl\]^ClSz]dr (8-149) 


where >4 [X] and C[6z] are differential opera¬ 
tors. The form of the operators A, C, and L* 
is determined by integrating X^L(M,h)[6zJ by 
parts. 

Putting 


bF^ 

i:W)[X] = — 
bz 


(8-150) 

Eq. 8-148 becomes 

SP= J'|-^6z-yl(X^)C(6zJ+-^6v 

3 «»} «r 


dv 




bF 


+ I I ( X^I(«,fe)(6zI + — 5u 
bu 






Substituting from Eq. 8-144 for I(m,6)[6zJ, 
this is 




- X^A„I(u,b)[z] 

.xr^lsu 

bu I 


+ {-^ - X^AjI(«,b)[z] 


db 


+ X^—^}Sb]dS2 


._^j 

db I 


dg 


A[\J^Cl6zl^ — dv 
bv 


dg 

+ ^6b dr, 

bb 


(8-151) 


The objective now is to eliminate explicit 
dependence of bP on bz. This may be done by 
requiring that 

^g 

-gj 6z->4[X]^C[6z] (8-152) 

be explicitly independent of 6z for all bz 
satisfying Eq. 8-145. This may be interpreted 
as requiring that on F certain components of 
6z be determined from Eq. 8-145 in terms of 
bv, bb, and the remaining 6z. The coefficients 
of all components of 6z remaining in Eq. 
8-151 must then be set equal to zero. These 
equations will then yield boundary conditions 
for X(x). 

Assuming all this calculation has been 
completed and X(x) determined, Eq. 8-151 
may be written as 

= ff A^(x)bu dn-h f n^(x)bv dr 
ft r 

where 


dO^ 

A(x) = -— -AL(u,b)lzJX + -^X 
ou “ dy 

n(x) = coefficient of 5v in Eq. 8-151 after 
substitution 



- A^L(w,h)[z]X 


+ 



dn 


+ / [coefficient of bb in Eq. 8-151 
^ after substitution ] dr 


(8-153) 
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By putting 

^=^0, F^fo (8-154) 

and 

one obtains 

&/= SS\^^{x)budQ.+ s Ti'^(x)dvda 

SI r 


^ = 


-K - 

-aGyl- 


where 

A = (a \ \l/^> 0). 


It will be required that 

6^= - Cl ^ 


(8-158) 


(8-159) 


(8-160) 


+ !i'^db (8-156) 
and 

. T 
SI 

T T 

+ Xn*“ ‘(x)5v£(r + e*“ Sb 

r 

(8-157) 

respectively. 

The expressions, Eqs. 8-156 and 8-157, give 
the desired explicit dependence of dJ and 
on 6w, 6v, and db. The problem is now 
reduced to determination of du, 6v, and db 
which give the greatest reduction in J subject 
to the constraints of the problem. 

84.3 A STEEPEST DESCENT COMPUTA¬ 
TIONAL ALGORITHM 


where Ci is a constant between zero and one. 
The idea here is to drive ib toward zero if a 

a 

constraint is violated or will be violated by a 
change in the design variables or parameters. 

For convenience in later development, 
define 

A'^(x) = { A*'^ ;aeA) \ 

n*(x)=( n'*'-; ae^) > ( 8 - 161 ) 

i* =i ; aeA ) . / 

In this notation, 

fS A* (xf bu dSl 

SI 

+ Sn*^{x)Svdr + i*^bb . (8-162) 


The procedure will now be to choose 6 m, 
6v, and db so as to minimize dJ subject to the 
constraints Eqs. 8-141 through 8-143, just as 
in pars. 8-1 through 8-3. In order to insure 
Eq. 8-141, define 


Likewise, define 




L/gz)(x)J 


(8-163) 
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and 


w(x) = 


- w, - 
J^E(x). 


where 

D{x) = (/1 0,(x) > 01 


dP^= 

n 


(8-164) 


(8-165) 


+ /6v^li/^6vdr + 66^H'j66 

r 

(8-171) 

where dP is “small”. The choice of dP will be 
discussed later. 


and 

E(x)^[i\ 0] . (8-166) 

It will be required that 
60(x) < - C2 0(x), xeO. (8-167) 

and 

dcjix) < - C3cj(x), xer (8-168) 
where 


The design variables and parameter, du, Bv 
and Bby are now to be chosen to minimize 5/ 
of Eq. 8-156 subject to Eqs. 8-160, 8-167, 
8-168, and 8-171. 

A multiplier rule of Liustemik and 
Sobolev, Ref. 14, page 209, will now be 
applied to the present problem. It guarantees 
the existence of multipliers, m(-^), x€l2, pt^(x) 

> 0, / > K->c), Jcsr, VfM > 0,j> f*. y, y^ 

> 0, a > r', Xo > 0, and 70 such that 

6(67) = 0 (8-172) 


0 < C2 < 1 and 0 < C3 < 1 

60(x) =-^ (x)6u(x) 
du 


and 


for all 6u, 6v and 66, where 


(8-169) 


6 / 


■//[- 


XoA-'^(jt) - 7^A* ^(x) 


6w(x) = -— (x)6v(x). (8-170) 

3v 

Before determining 5v, and 5b, a device 
should be introduced to insure that these 
quantities are small as is required in order that 
the preceding first order analysis is a good 
approximation to reality. The engineer should 
choose positive definite weighting matrices 
H'j,(x), and so as to associate a 
relative importance to all the variables. It is 
then required that 


8-30 


/[- 


- M^(x) -^l6udl2 
x®n''^(x) - 7^n* ^(x) 


-7o6v^H', 

- v'^ix) 4^16vdr 

ov J 

+ Xo?'^ - 7^fi* ^ - To W H'j66. 

(8-173) 
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Using hJ of Eq. 8-173 in Eq. 8-172 




- 7 ^ A* (x) 
- 27 o5«^»'„ 


d€l 


At this point it is assumed that the problem 
is normal so that X© = 1 may be chosen. Eqs. 
8-175 through 8-177 yield 



X 


- A-'Cx) - A* (xh - 

du 



xen (8-178) 


-7^n'<'^(x)-27o6v^H'^ 

-•'^(x)4^1s*vdr 
3v J 

+ [^- Xoe'^ - 'r^i* 

- 27 o 6 ft»'j j hH , 

(8-174) 

for all 6*«(3t), xef2, 6*v(x), xsr, and 6*ft. 
This implies 

- Xo-'fx) - A* (x) — 27 o ft'u Su(x) 
di^ 

#i(x) = 0 (8-175) 


6v(x) - Wy *(x) 


X 1^- n‘^(x) -nl'y 

1 

--g^Kx) ,xer (8-179) 


and 


W-' (- -S*y). (8-180) 


Assume for the present that Eqs, 8-167 and 
8-168 are equalities. Substituting Eqs. 8-178 
and 8-179 into Eqs. 8-167 and 8-168 yields 



30 

du 


^-1 


^-A*' - A *7 



forx€£2 


- Ci^, xen 


- Xon'^(x) - n* (x )7 - 2yo Sv(x) 


dcj^ 

- —- Kx) = 0 (8-176) 

dv 

for xEF, and 

- 7 - 27oH'^^ = 0 . (8-177) 


and 


•')=-c.<5. »er. 
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Since and R'”* are positive definite, the 
matrices (b^lbu) W~^ (b^^lbu) and (bcbibv) 
M/”* {bC/jbv) are positive semi-definite. It 
will be assumed that they are positive definite 
and hence nonsingular. In case 0 or ci is 
empty, then the terms multiplying // and v do 
not exist. In this case simply define // = = 0 
and 00/dw) (b^'^lbu) = (d<i/dv) 

(bcb^Ibv) = 1. In any case. 


fiix) =A^ * \2yoC24> 

L dw “ 

X (- A-' - A* 7 )j , xesi 

where 


(8-181) 




bu 


and 

»<x) = A“"* |^27 oC 3W + -^ h'7' (-n^ 
-fi’^7 )j,xer (8-182) 

where 



w;' 


bCb^ 

bv 


X (-n-'-n*7) 


-1 1 

—g^A" 270C3W Lxer. 


(8-184) 


In order to determine 7, these expressions are 
substituted into Eq. 8-160. Using Eq. 8-182, 
the resulting equation is 


=-Cj (8-185) 


Substituting from Eqs. 8-181 and 8-182 into 
Eqs. 8-178 and 8-179 yields 




■) 


where 


M 


ifJ 







A^dSl 


X (- A-' - A* 7) 



1 


2yuC2^\x^Q. 




-1 


X 


bCi 

bv 



n-'dr 


(8-183) 




(8-186) 
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«..= A.- 

n 

’“'E A*<® 

*/n*V'('-v A*-' 

r 

dv " / 






(8-187) 

(8-188) 


and 


f T 

3f = y 


A**' CO c/r. 


(8-189) 


It was shown in par. 8-2 that the matrices 
in the integrands for ^ are positive semi- 
deflnite. Therefore, ^ is at least positive 
semi-deflnite. It will be assumed in what 
follows that is positive definite and, 

therefore, nonsin^ar. Solving Eq. 8-185 for 
7 then yields 


y = [27o(C. ^ . ) 

(8-190) 


Substituting 7 from Eq. 8-190 into Eq. 
8-180, Eqs. 8-183, and 8-184 yield 

«u(x) = -^5u'(x) + 6m*(x) (8-191) 

27o 


6v(x) = —6v*(x) + 6v^(x) 

27o 


(8-192) 


and 

bb = 8b^ + db^ (8-193) 

27o 

where 

X (a-' - M^^,xesi (8-194) 

6«*(ac) = li';* A*"' 

\ du du “ J 

+ C3^*u;>] (8-195) 

^.xesi 


6v>(x)=H';' 

X (n-'-n*iw-; M^'^,xer (8-196) 

6v*(x) = »';' 

" \ dv dv ” / 

X [n*i»f-;(c,^ + C3i»f^- 

+ (8-J97) 

-CaV-^-^A-"' cS.xer 
^ dv 
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5fc> = H'j ‘ (i^ - i* (8-198) 

and 


5ft* = w;' i* M~\+ 

+ C^M^.y (8-199) 


00 . 

4. —— bu^ = — C 20 , in ^2 
du 


d0 

5 . = 0 ,in ^2 

bu 


bCi - 

6 . T— 6 v^ = - C 3 W, on r 
dv 


It should be noted that if there were no 
constraints then du, dv, and db would reduce 

to - A-', - n-', and - 

2 yo 270 270 

respectively. In order that the change in 
design variables and parameters should be in 
the negative gradient direction, it is clear that 
7 o > 0 is required. The magnitude of 7 © 
could be determined by substituting Eqs. 
8-191 through 8-199 into Eq. 8-171. How¬ 
ever, dP must be chosen so it may be just as 
well to simply choose 7 © in Eqs. 8-191, 8-192 
and 8-193. 


7 . = 0 ,onr 
3v 

8. -fi-'^Sft* - Jn-'^ftv'dT 

r 

= SS < 0 

n 

A computational algorithm may now be 
stated based on this development and the 
arguments presented in par. 8 - 2 . 

Algorithm 


Just as in the problems of pars. 8-2 and 8-3, 
the variations 6 m‘( x), 6 m^{x), 6 v‘W, 6 v*(x), 
66 *, and db^ satisfy Theorem, 8-2. 

Theorem 8-2: The above variations satisfy 
the identities 

1 . 5ft‘^H'.5ft*+ / 5v‘»',6v*dr 

’’ r ” 

+ //6u‘H'„6M*dn = 0 

2 . £* ^6ft* + / n* ^5v*dr 

r 

+ SS A*^du^da = -Ci^ 

SI 

3 . 

r 

+ //A'^^6M‘dn = 0 

SI 


Step 1. Make an engineering estimate 

of the opti¬ 
mum design functions and param¬ 
eter. 

Step 2. Solve Eqs. 8-136 and 8-137 for 
corresponding to 
and 

Step 3. Check constraints and form 0, 
and cj of Eqs. 8-158, 8-163, and 
8-164. 

Step 4. Solve the differential equation, Eq. 
8-150, subject to the boundary 
conditions generated by Eq. 8-152 
with g and F given by Eqs. 8-154 
and 8-155, to obtain and 
respectively. 

Step 5. Compute A'^(jc), A^ (x). 
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n^(x), and in Eqs. 8-153 and 
8-161. 

Step 6 . Choose the correction factors Q, 
C 2 , and C 3 in Eqs. 8-160, 8-167, 
and 8-168. 

Step?. Compute and 

^ in Eqs. 8-186 through 8-189. 

Step 8 . Choose 7o > 0 and compute 7, 
nix), and v{x) in Eqs. 8-190, 
8-181, and 8-182. If any compo¬ 
nents of 7 with Of > r\ fi(x) with 1 
> or v{x) with / > f' are 
negative, re-deflne rp, ^x), and 
ci(x) by deleting corresponding 
terms and return to Step 5. 

Step9. Compute 6 w‘(x), 6 m^(x), 

6v^(x), 6 b‘, and 6 b^ in Eqs. 8-194 
I through 8-199. 

Step 10. Compute 

U<'>(jt) = tt<®>(jt) (;c) 

270 

+ 

v<‘>(jc) = v<®>(x)- — Sv* (x) 

270 

+ 

fc(l) =ft(0)-Lgftl + 5 ft 2 . 

270 

Step 11. If the constraints are satisfied and 
6 m‘(J f), 6 v‘(->f), and 6 b‘ are suf¬ 
ficiently small, terminate. Other¬ 
wise, return to Step 2 with 
v^®^(x), and b^®^ re¬ 
placed by vO>(x), and 

bO), respectively. 

I 


As in par. 8-2, if after several iterations the 
constraints are all satisfied, 6m^(x), 6v^(x), 
and 6 b^ will all be zero. In this case, 

= < 8 - 200 ) 

where 

\ dv dv ”/ 

+ If 

n “ 

Just as in par. 8-2, one can now specify a 
reasonable, desired reduction 6 / in /. The 
formulation, Eq. 8-200, provides a means of 
finding 70 that, based on the preceding linear 
approximations, will yield the desired reduc¬ 
tion in the cost function. 

8-5 OPTIMAL DESIGN OF AN ARTIL¬ 
LERY RECOIL MECHANISM* 

An artillery weapon mounted on tires or 
tracks has some undesirable features. Unlike 
the hard mount (weapon rests on a base 
plate), the flexible mount will have a pitch 
motion. During the recoil stroke, when the 
weapon is fired at 75-deg elevation, the tires 
load up or compress; and when counterrecoil 
begins, the tires act like a spring and unload 
sending the tires off the ground. It is quite 
obvious that, when the weapon comes to rest, 
the likelihood of it being zeroed in for the 


*The results of this paragnph represent the work of Mr. 

T. D. Streeter, Ref. 15. 
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next round has been reduced considerably, 
especially for high rate of fire weapons. This 
phenomenon is known as a secondary recoil 
effect, because an additional acceleration 
term enters into the recoil equations. Because 
of this secondary recoil effect, the recoil 
mechanism design becomes much more dif¬ 
ficult. For short recoil, the orifice areas in the 
recoil mechanism are designed at maximum 
elevation (75 deg). Therefore, when elevation 
is mentioned throughout the remainder of 
this report, it refers to maximum elevation. 
The weapon positioned for high-angle fire is 
shown in Fig. 8-1. 

The purpose of this paragraph will be to 
develop a systematic recoil mechanism design 
procedure characterized by mathematical 

8-36 


modeling for the high-speed digital computer. 
To do this, the steepest descent, numerical 
technique is used to minimize the hop or 
pitch motion of the weapon and, at the same 
time, to determine the necessary control 
rod * design that will minimize hop. 

The recoil equation for a rigid mount is of 
the form 

i+/(x)x^ +g(x) = A(r) (8-201) 

where x is the displacement of the recoiling 
parts, g(x) is a restoring force, and h(t) is the 


*The control rod in a hydraulic recoil mechanism is a rod of 
variable cross section which moves through a larger orifice 
during recoil and varies the area of the orifice to control 
recoil force level 
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breech force due to firing. In the second term 
of this equation, the expression for the effect 
of the control rod orifice areas can also be 
obtained from a predetermined recoil force*, 
Rit). For the flexible mount, Eq. 8-201 is 
coupled with the equation describing the 
pitch motion of the weapon and thus yielding 
two second-order nonlinear ordinary differen¬ 
tial equations with prescribed initial condi¬ 
tions. R(t) will be taken as the control 
variable which is to be determined to mini¬ 
mize hop (the pitch motion of the weapon) 
subject to other design constraints. The 
orifice area is then determined to provide this 
recoil force. 


This study was performed on a develop¬ 
mental weapon, namely, the XM164. The 
XM164 is a lightweight, split-trailed towed 
105 mm howitzer with the XM44 hydropneu¬ 
matic recoil mechanism. Unlike a rigid mount, 
the XM164 is flexible and is fired while 
resting on rubber tires. 


For a rigid mount weapon, the resisting 
force R(t) on the recoiling parts is designed 
with a trapezoidal shape as shown in Fig. 8-2. 
With the proper design of the control rod 
orifice area, the flow of oil in the recoil 
mechanism is controlled and such a force, as 
shown in Fig. 8-2, can be obtained. However, 
when a force (shaped as in Fig. 8-2) is 
designed for the flexible mount, the question 
is asked, “Can this force be applied with some 
other ‘best* shape, such that it will reduce the 
pitch of the weapon?” This is the basic 
question with which this design problem is 
concerned. 


*The recofl force is the retarding force on the rearward 
traveling barrel during recoil, due to throttling of oil 
through the variable area ortfice. 



Figure 8-Z Recoil Force for a 
Rigid Mount 


8-6.1 FORMULATION OF THE PROBLEM 

During the recoil, counterrecoil cycle, there 
are four different times which are of concern. 
These are shown in Fig. 8-3. 



Figure 8-3. Time Intervals 


In Fig. 8-3, the special times noted are: 


f® = firing of round 
f ‘ = end of the recoil stroke 

= time at which maximum hop occurs 
= end of counterrecoil 

At these four times certain conditions must 
be satisfied from the design requirements. At 
time f® the initial conditions for the state of 
the system are given. At time f * the displace¬ 
ment of the recoiling parts is required to be 
equal to some specified value and the velocity 
of the recoiling parts must be equal to zero. 

8-37 


Digitized by i^ooQle 



AMCP 706*192 


At time the velocity of the pitch motion 
must be zero (necessary condition for maxi¬ 
mum pitch), and the displacement of the 
pitch motion is to be a minimum. Note that it 
will be possible for to vary between and 
Therefore, the hop or pitch motion will be 
minimized for the entire counterrecoil stroke. 
At the final time which is the end of 
counterrecoil, the recoiling parts must return 
to their original position, and the velocity of 
the recoiling parts will be some specified value 
F 3 . This is to insure that the recoiling parts 
return to the latch position. It will also be 
demanded that the total cycle time be equal 
to Cj seconds. 

Formulating this problem into the mathe¬ 
matical notation of par. 8-3 yields 

Minimize / = ) = g© (8-202) 


subject to the equality constraints: 
'ill =Jc,(r')-no + Vx=«> “0 


ii% =JCjO®)-no =«j =0 

03 - V3 =gi =0 

£2' =JCi(/') = 0 


> (8-203) 


n* =X3(r*) = 0 


n® = - Cj. = 0 


with the full set of initial conditions 


Xl( 0 ) = X 3 ( 0 )=X 4 ( 0 ) = 0 , 


•^ 2 ( 0 ) = t?O 


I (8-204) 


where 1 = 1,2,3 are intermediate and 
terminal constraint functions to be satisfied; 
^ 2 *, ^ 2 ^, and ^ 2 ^ define the times at which the 
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intermediate and terminal constraint func¬ 
tions occur; Xi and are the velocities of 
the recoiling parts and pitch motion, respec¬ 
tively; X 2 and X 4 are the displacements of the 
recoiling parts and pitch motion, respectively; 
\pi = 0 is the constraint on the displacement 
of the recoiling parts such that at the end of 
the recoil stroke the displacement wiU be 
exactly equal to inches. = 0 is the 
constraint demanding that the recoiling parts 
return to the latch position at the end of 
counterrecoil. ^3 = 0 is the constraint which 
requires that the velocity of the recoiling 
parts come into the latch position at a 
velocity F 3 inches per second. 12 * = 0 defines 
the time at which the end of the recoil occurs; 
^ 2 ^ = 0 defines the times at which the pitch 
velocity is zero and the one with the largest 
displacement is selected, thus defining the 
time at which maximum hop occurs; and ^ 2 ^ 
= 0 defines the total cycle time to be exactly 
equal to Cj. seconds. 

It was previously mentioned that the rod 
force was taken as the design (control) vari¬ 
able, instead of the orifice areas. Using the 
rod force as the design variable simplifies the 
problem, and it also gives the engineer more 
insight into the design process since he has an 
intuitive feel for the force levels the weapon 
system he is designing can tolerate. Thus, 
immediately the engineer can specify an 
admissible upper limit for the recoil force, say 
^max’ ^ design, and this value may be 
varied by the engineer for any redesign. The 
following inequality constraint, therefore, 
must hold for all time /. 

= 0 (8-205) 

The optimization problem has now been 
formulated. All that must be done now is to 
put the problem into the steepest descent 
formulation. Par. 8-5.2 simplifies the equa¬ 
tions of motion for the XM164 Howitzer. 
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8^.2 EQUATIONS OF MOTION FOR THE 
XM164 HOWITZER 

A schematic diagram showing the moving 
parts and variables defining the dynamic 
model of the XM164 Howitzer is shown in 
Fig. 8-4. In explanation of this physical 
idealization, the following variables are de¬ 
fined: 

= weight of recoiling parts 

= weight of elevating parts less 
recoiling parts 

= weight of nonelevating parts 


AMCP 706-192 

t? = recoil displacement 

f = distance from center line of trun¬ 
nion to center line of recoiling 
parts (vertical) 

= distance from center line of 
spade to center line of trunnion 
(horizontal) 

7 = angle of elevation of gun tube 

= distance from center line of 
spade to center line of trunnion 
(vertical) 


= mass of recoiling parts 


= pitch angle of weapon 
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R{t) = rod force 

B(t) = breech force 

g = acceleration due to gravity 

M = coefficient of friction 

= friction force (guide) 

S 2 = friction force (guide) 

= mass of elevating parts less recoil¬ 
ing parts 

= distance from center line of trun¬ 
nion to mass center of (hori¬ 
zontal) 

ij, = distance from center line of trun¬ 
nion to mass center of (verti¬ 
cal) 

= mass of nonelevating parts 

= distance from center line of 

spade to mass center of W. 

a 

(horizontal) 

= distance from center line of 
spade to mass center of 

(vertical) 

4 = traverse moment of inertia of W 

about its own CG 

4 = traverse moment of inertia of 

about its own CG 

4 = traverse moment of inertia of 

about its own CG 

f 1 = distance from center line of trun¬ 

nion to center line of B(t) 

8-40 


application 
= Static value of 0 

c = damping coefficient 

^ = spring constant of tire 

Qi = distance from center line of trun¬ 
nion to rear of cradle (horizon¬ 
tal) 

^2 = distance from center line of trun¬ 

nion to front of cradle (horizon¬ 
tal) 

ii = distance from center line of trun¬ 
nion to R(t) application (verti¬ 
cal) 

Of = distance from center line of trun¬ 
nion to bottom of rail (vertical) 

P = distance from center line of trun¬ 
nion to top of rail (vertical) 

B = distance from tires to trail spades 

y»Z = axes fixed in trunnion, parallel to 
X-, y-axes 

y»z = axes fixed in carriage 

H,Z = axes fixed in cradle 


The differential equations to be solved are, 
Ref. 16: 

“ (f - yt + z^ cos7)0 
- (^ + >'^ COS7 + sin7)0^ ] 

= R{t) - Bit) - M^g sin (7 + 0 ) 
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- #i(l 5i I + I S 2 I) sgn (ij) (8-206) 
COS 7 + Z, siiry)* +^jl(»?i cosy 
+ >',-?* sin 7 )* +(i?j, sin 7 + z, 

+ fftCOST)»]+i»f^0'i+z5) + /.+4 
+ /j I 0 + 23f^i}«(tj + y, cosy + z, siiry) 

- 0* (f? + j', COS 7 + z, sin 7 ) 

X (f - y, sin 7 + z, COS 7 ) = B(t) • (f, - f) 

+ [^(0-M(I5, 1 + 15, |)sgn(tj)l 
X (f - j', sin 7 + z, COS 7 ) 

- 5 I ^«(»? + J', COS7 + z, sin7) 

X cos (7 + +^^Cyj cos^ - Zj sin 0 ) 

+ Cv, COS 0 - z, sin^ + cos (7 + 

- f* sin(7 + 0)1 I - *(0 + 0,,) - c0 

(8-207) 

12 n 0 +(V+y, COS 7 + z, sin 7)0 

- (f + z, COS 7 - y, sin 7 )^* ] 

= 5, + 5, - cos (7 + 0) (8-208) 

—t?) +5,(^1 — f?) 

-Bit) '(i-io+RU) -(f-fz) 
-#/ll5, l(f-a) 

+ 15, |(f-«] sgn( 7 j). (8-209) 

Eqs. 8-206 and 8-207 are the translational 
and rotational equations of motion, respec¬ 


tively, for the XM164 Howitzer. Eqs. 8-208 
and 8-209 determine the guide friction. 

For small 0 the following approximations 
are made: 

sin 0 = 0 

0 * 

COS0 = 1 — -J- 

The cos (7 + 0 ) and the sin (7 + 0 ) then 
become 

, COS 7 

cos (7 + 0 ) = COS 7 — 0 ' —— — 0 sm 7 

sin (7 + 0) = sin 7 — 0* sin 7/2 + 0 COS 7 . 

From these approximations and the follow¬ 
ing definitions, Eqs. 8-206 and 8-207 can be 
simplified. For simplification, define: 

CX)N1 =i»f, 

CX)N2 = -Mgii - y, sin 7 + z, COS 7 ) 

CONS = - # 1 ( 1 5, I + 15, I) sgn (q) 

CX)N4 = - y, COS 7 + z, sin 7 

CONS =(»#*[ iVb COS7 +y, - sin7)* 

+ (tJj sin 7 + z, + f* cos7)*J 

CON 6 = 5 -y, sin 7 + z, COS 7 
CON7 = f, -f 
CX)N 8 = sin 7 
CX)N9 = C 0 S 7 

CX)N10 = -i»f, •g*CON 8 

a 

CX3N11 = Af^ -g-com/l 

8-41 
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CON12=J»f^ • CX)N4 
CON13 = -2Af, 

O 

CON14 = -2Af,-CON4 
CON15 = - 
CX)N16=Af^-CON6 
CON 17 = Af„ •CON6-CON4 

a 

CON18 = -^*i»/ •CON9 
CON19=g-J»f^-CON9/2 
CON20 = - g-M^ •CON4-CON9 
CON21 =g-i»f^-CON4-CON9/2 

CON 22 = -g-Afj*>>^ 

corns 

CON24 = g'Mj 'z^ 

CON25 = —g-Mf^ -y^ 

CON26 = g’Mf^ ’y,l2 

COmi^g-M^'z, 

CON28 = -g-M^'ft^'COW 

CON29 = g'M^ -ijj •CON9/2 

CON30 = «-Afj-fj*CON8 

CON31 = - g'M^ -fj •CON8/2 

CON32 •CON8 

CON33 •CON4'CON8 

CON34=J»fj*^-i7j-CON8 

8-42 


CON35=i»fj-^*fj-CON9 

CON36 = CON20 + CON22 + CON25 
+ CON28 + CON30 

CON37 = CON21 + CON23 + CON26 
+ CON29 + CON31 

CON38 = CON24 + CON27 + CON33 
+ CON34 + CON35 - k 

CON39 =CON15+CON36 

CON40 = -J»f^-g-CON9 

With these definitions, Eqs. 8-206 and 
8-207 may now be written as 

CONl-ti +CON2*«= 

+ CON3 + CON10 
+ CONn*^* 

+ CONI 2-^* 

+ «*CON40 

(8-210) 

+ CON4)* + CON5]<> = 
CON13-^-^-t? + CON14-n« 

+ CON38-« + CON32-f?^ 

+ CON39 -c^+ CON16-^*f? 

+ CON17*^* +5(r)-CON7 
+ [/?(/)+ CON3J*CX)N6 + CX)Nl 8-f? 

+ CX)N19-fj-^* +CON37'^» 

( 8 - 211 ) 
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Eqs. 8-210 and 8-211 
following form 

>’iin +v,,« =v ,3 

= Vjj 


can be put into the 


( 8 - 212 ) 


where 


V,, = CONI 


X, 

Xj =7? 
■*3=0 
JC4 =0 


(8-214) 


When this is accomplished, the following 
first-order equations yield the proper formula¬ 
tion that will be used in the steepest-descent 
scheme: 


V,, =CON2 


v,3 =/?(/) -5(0 +CON3 +CON 10 

+ CX)Nll-0* +i»(,t}0* 

+ CONI 2*0** + CON4O*0 

V,, =0 

V2 j = (n + CON4)» + CONS 
j>jj = CON13 *i707J + CONM’t}^ 


JCl =(>'l3*»'ji - \ 


/(«'ir»'jj)=/i 


Xi =Xi =f2 
JCs = V23/V32 =/3 
*4 =-*3 =/♦ 


> (8-215) 


/ 


8-5.3 STEEPEST DESCENT FORMULA¬ 
TION 


+ CON38*0 + CON32*n0 
+ CON39 - + CX)N16-^*ti 

+ CONI 7-^* +5(0*CON7 
+ (/?(0 + CON3)-CON6 
+ CON18-I? + CON19-f?*0* 

+ CON37-0* 


Eq. 8-212 can now be written as 


T? = (Vi 3 •Vjt - Vi 2^2 3)1 1 • 1^22) 


(8-213) 


0 =V 2 3 /V 3 2 - j 

By making the following definitions, Eqs. 
8-213 can be put into first order form: 


The optimal design problem can be stated 
as follows: Determine the design variable 
R(t) in the interval 0 < / < so as to 

minimize J = X 4 (t 2 ) (8-216) 


subject to the constraints 


01 =*3(^‘)-90+n„,x =0 ' 

03 =X2(t^)-Vo =0 


03 =JCl(t^)- 1^3 =0 
i2* =x,(r*) = 0 


> (8-217) 


n* =X3(/*) = 0 


03 =/3 -Cj. =0 


8-43 
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0 < 0 (8-218) 


and satisfying 

X =/ (8-219) 

where the components of the vector / are 
given in Eq. 2-215 and initial conditions are 


Xi(0) = ac3(0)=^4(0) = 0, JC2(0) = r?o - 


The minimization problem stated here 
starts with an estimated design for /?(/), 
analyzes it, and then improves on the design. 
The steepest descent technique of par. 8-3 is 
used here to solve the design problem stated. 
The first step in implementing the computa¬ 
tional algorithm of par. 8-3.3 is computation 
of auxiliary variables required for the al¬ 
gorithm. 


8-6.3.1 DETERMINATION OF THE AD¬ 
JOINT EOUATIONS 

The adjoint equations are, from Eq. 8-93 

where the vectors / and x are deflned in Eqs. 
8-215 and X = (X,,X 2 ,Xj,X 4 )^. 



8-44 


'[-(If)-(lr)]) 

/(•'ll via) /=1.2,3,4 


3/i 

=-v,2jt3(CON13*ac, +CX)N14) 

/(•'llVjj) 

Vi ( 

3 ^ = j >',,Vjj[2/I/^v,3(jc2 +CON4) 


+ J^a^ 22 Xl - V,j (CX)N13-Jf,X3 
+ CON32*ac4 +CX)N16*Jci 
+ CX)N18 + CON19-;cJ)J 


Xl2^^^v^^(X2 + CON4) J j /(V? ,vi2 ) 
9/i 

3^ = Iv22(23f^X2X3 + 2-OON12X3) 

- v ,2 (CON13*x,jr2 +CON14-X, 
-C+2*C0N16-X3X2 
+ 2-CONl 7*X3) J /(v,, vj J ) 


9/i 

^ = [v2 2(2*CON11*jc4 +CX)N40) 
0 x 4 

-v ,2 (C0N38 + C0N32'X2 
+ 2*CX)N19-X2X4 
+ 2*CON37-ac4))/(v,, V 22 ) 
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bxi 

dxj 


dxj 


3X4 


3x, 


<=1,2,3,4 

■xj (CON13«Xi C0N14)/«»22 

^ [V2 2 (CON13«XiX3 ■^CON32•X4 
+ CON16-x| -K^NIS 
■K:ON19*x3) 

-2M^V2 3(X2+CON4)]/v}2 

= (CON13-x,X 2 + CX)N14«x, - C 

+ 2-C0N16*X3X2 + 2 CON17-X3)/v22 

^ (CON38 + CON32-X2 + 2-CX)N19 

•X 2 X 4 + 2’CON37*X4)/v2i 

:0^ = 0^=1 = 0 
* * Av * Av 

0 X 2 0 X 3 0 X 4 


The adjoint equations now become 

0 


X = - 


3/i 

1 

Ml 

3x, 

3x, 

a/i 

n 

Ml. 

3X2 

V/ 

3X2 

3/i 

n 

Ml 

^X2 

u 

3 x 3 

3/i 

n 

Ml 

dX 4 

u 

3X4 


X (8-220) 


a-5.3.2 DETERMINATION OF THE 

BOUNDARY CONDITIONS FOR 
THE ADJOINT EOUATIONS 

Because of the intermediate constraint 
functions, we must evaluate X at and/‘“to 

allow for any discontinuities which may occur 
across and Since the initial conditions 
for the adjoint equations are given at these 
equations are integrated backwards on the 
time interval shown in Fig. 8-5. Integration is 
carried out by integrating from to 
Application of Eq. 8-116 provides new initial 


- 4 +-4- 


+ 


3 Time 


r r 

Figure 8-5. Recoil Time Interval 

conditions at Integration is then per¬ 
formed from to Likewise, using new 
initial conditions at integration is then 
performed to 

It is the object of this paragraph to 
determine the initial conditions at /^",and 
for the four different integrations per¬ 
formed on the adjoint equations, i.e., for ^ 1 , 
^ 2 . ^ 3 »and/. 

Since of Eq. 8-203 does not depend 
explicitly on x. hil^jhx = 0 and Eq. 8-114 
reduces to 

Thus, 

X-'a*) = ( 0 , 0 , 0 ,of 


where the partial derivatives are as computed 
in this paragraph. 


X**(/*) = ( 0 , 0 , 0 , 0 f 
x**(r") = ( 0 ,i, 0 , 0 f 
X**(<*) = (i,o,o,of 


( 8 - 221 ) 


8-45 
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Integration of Eq. 8-220 backward from 
to + 0) can now be effected, using the 
initial conditions of Eq. 8-221. Since Eq. 
8-220 is homogeneous in X and the initial 
conditions on and » are zero, it is clear 

that + 0) = + 0) = 0. While 

X'*'’ (t* + 0) and X'*’’ (t* + 0) will not be zero, 
they may be treated as known. There is now 
adequate information to use Eq. 8-116 to 
determine X(/^ — 0) for all four adjoint 
variables. 

In order to find X(/^ - 0) in Eq. 8-116, 
choose the minus sign alternative throughout 
and obtain 


Thus, Eq. 8-222 yields 


- 0 ) = 



- 0 ) = 


0 

0 

0 

0 _ 


- 0 ) = x*>(r’ + 0 ) 



- 0 ) 


(• 

0 

I 

0 


> (8-223) 


and 


X(r» -0) = X(r»+0) + (|j^) 


X **(/2 _ 0 ) = X **(/* + 0 ) 



(r* 


/,(/* - 0 ) 


0 

0 

0_. 


1 / 


+ X^(r* + 0) lf(t^ - 0) 


-/(t*+0)J 



( 8 - 222 ) 


It may be noted that X*' (r' + 0) = 0 will 
result from integration of the homogeneous 
Eq. 8-220. 


Finally, at Eqs. 8-107 and 8-109 yield 


Using Eqs. 8-107 and 8-109 to determine 
g(/^-0) and n(/^-0), one obtains at - 0 


iV -0)=/4(r* -0) 


-0) = o 


-0) = o 

_ 0 )=/a(r' - 0 ) 

-0) = o 
-0) = o 


- 0 ) = 0 


and 


i 3 (r* - 0 ) = 0 


n>(r> -0)=/,(r' -0) 


and 

- 0 )=/ 3 (/^ - 0 ). 


Replacing with r*, and with x* in Eq. 
8-222 provides the proper jump conditions at 
. They are 
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x*'(/' -0)*x*'(r‘ *►0) 


_ X^^r* ■»-0)l/(r‘ -0)-/(/» -»-0)l 


"o“ 


"r 

1 

r ^ 1 

0 

0 

L/i('‘ -o)J 

0 





X*'(/' -0)- 


X**(r' -0)-X*‘(r' +0) 

X*‘^(/' +0)l/(/' -0)-/(r' +0)1 
/,(/' - 0 ) 

X**(/' -0) = X*’U' +0) 

X*»^(r' +0)l/(f' -0)-/(f' +0)1 
/,(/• - 0 ) 


>(8-224) 


Eq. 8-220 may now be integrated from 
to r®, with jumps at and defined by Eqs. 
8-223 and 8-224. This completes computation 
required by Step 5 of the algorithm of par. 
8-3.3. 

8-5.3.3 COMPUTATION OF DESIGN IM¬ 
PROVEMENTS 

The remaining steps of the computational 
algorithm of par. 8-3.3 require only routine 
calculation. Some of the key formulas are 
highli^ted here to illustrate use of the 
algorithm. In Step 6, the following calcula¬ 
tions are effected: 


> 

II 

1 

/= 1,2,3, 


if 0 > 0 

A*(r)={ 


1 0 , 

if 0 < 0. 


where (t) is a weighting factor, set equal 
to one in this example. 

With these factors defined, one must 
choose the magnitude of constraint error 
correction to be used, in Step 7. In the 
current problem, reasonable design estimates 
led to small errors, so a == c == 1 was chosen. 

For Step 8, only the following routine 
numerical integrations are required: 


M 




where 


■( 


A* 


d(t) = 



= I A* d(t)\* dt 


M 




. (' A*" 


^dt. 


Eq. 8-80 was used to find 7o so that a ten 
percent reduction in cost function is sought. 
From Eq. 8-43 (since a = c = 1) 

7 = 

and from Eq. 8-35 

nit) = - A*(0-' 1(1 - diOy.A' + A* 7) 

- 27o0] • 

At any points where /i(/) < 0, delete this 
point from the domain of ^t) and return to 
Step 6. 
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FinaJJy, the improved design is provided by 
Step 11 of the algorithm, where entries arc 
computed directly from Eqs. 8-44 and 8-45. 
While these formulas arc a bit messy, they arc 
routinely programmed matrix computations 
which are no real challenge to the computer. 

SSA RESULTS AND CONCLUSIONS 

Fig. 8-6 represents the optimal recoil force 
to minimize hop at 75-dcg quadrant elevation 


important factor in reducing the hop; Le., the 
faster the recoiling parts acederate during this 
period, the greater the reduction in hop. As 
one would expect, an increase in allowable 
recoil length also reduces hop significantly. 
An increase in the maximum rod force will 
also reduce hop, for example, if the recoil 
force is allowed to obtain the value 24160 lb 
in constraint set of Eq. 8-225, the hop can be 
reduced an additional 0.32 iiL Fig. 8-7 shows 



i I 1 i I 1 » 1 1 1 » » » « 

0 10 20 30 40 50 60 708090 100110120130 140 


Time, msec 

Figure 8-6. Optima! Rod Force 


with the following constraints: 


R{t)< 220001b 
recoil length = 28 in. 


(8-225) 


The resulting hop for this case is 1.53 in., i.e., 
the tires leave the ground 1.53 in. for a 115% 
maximum rated pressure breech force. If the 
constraints were relaxed, such that 


R(t)< 235001b 
recoil length = 29 in. 


(8-226) 


the resulting minimal hop is 0.88 in. 


The acceleration of the recoiling parts 
during the first portion of counterrecoil is an 
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XM164 Control Rod Dei^ CompariKyn Between 
l^eaent Groove* and Optimum Groove* for Minimum 
HOP at 75>deg Quadrant Ekvatiofli and 115% Maximum 
F^eature Breech Force 
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Figure S-7. Optima! Control Rod Design 


a possible variable oriHce area design for short 
recoil. The orifice areas were obtained from 
the recoil force in Fig. 8-6. The resulting force 
levels from the new groove design are indicated 
by the dotted lines from 0.110 sec to 0.13 
sec. Fig. 8-6. The recoil force is the same as 
the optimal shaped force curve from 0 to 
O.llO sec. The increase in hop is approxi¬ 
mately 0.1 in. The recoil length changed a 
very small amount. 

An interesting side point is that of the 
speed of convergence. The nominal design 
variable R{t) used for the first iteration was 
such that at the end of counterrecoil the 
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recoiling parts were 250 in. away from the 
latch position and the required fmal velocity 
of 6 in./sec, was 96 in./sec. In approximately 
14 iterations, convergence was obtained 
which seems to be very fast if one considers 
the complexity of the equations involved. 

Results from firing tests show a significant 
reduction (50% or more) in hop can be 
achieved simply by increasing the tire pres¬ 
sure. Because tire performance information is 
not presently available, it was assumed 
throughout this analysis that the spring rate 
of the tires was constant (linear spring). 
Therefore, it is not known what results would 
be obtained under a nonlinear spring model. 
Tire manufacturers are investigating methods 


to optimize tire characteristics for the final 
configuration in the tire itself. In order to 
obtain optimum weapon performance for 
flexible mount systems, such information as 
tire performance could be incorporated into 
the mathematical model and perhaps tire 
characteristics could also be optimized in the 
environment for which they are being used. 

The technique used here has the capability 
to optimize many design parameters simulta¬ 
neously. If there exist other sensitive param¬ 
eters, consideration should be given to opti¬ 
mize them along with the design variable Rit). 
This study clearly indicates that weapon 
performance can be improved by using 
methods of optimal design. 
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CHAPTER 9 

APPLICATION OF STEEPEST DESCENT METHODS TO 
OPTIMAL STRUCTURAL DESIGN 


9-1 INTRODUCTION 

The same general class of optimal structural 
design problems considered in Chapter 5 is 
treated in this chapter by using the theoretical 
results and computational algorithms de¬ 
veloped in Chapter 8. For a discussion of the 
essential elements of the design problem to be 
considered, the reader is referred to par. 5-2. 

The basic difference between problems 
treated in this chapter and those treated in 
Chapter 5 is in the nature of the design 
variables and the associated state variables 
that describe the response of the structure. In 
this chapter, distribution of the material along 
members of the structure will be continuous 
as opposed to discrete, as treated in Chapter 
5. Consequently, continuous variation of 
stress, displacement, and eigenfunctions along 
members of the structure will need to be 
determined. In this sense, the present problem 
is infinite dimensional whereas the problem 
treated in Chapter 5 was finite dimensional. 
The optimal structural design problem, there¬ 
fore, will involve boundary value problems as 
opposed to algebra problems. 

The continuous design problem treated in 
this chapter will have more features to ac¬ 
count for than did the discrete problem. For 
example, there will be differential equations, 
boundary conditions, pointwise inequality 
constraints, and functional constraints. 
Further, nonclassical analysis problems may 
arise which require special techniques for each 


particular problem. For this reason, it is more 
difficult to give a general formulation of the 
problem into which every structural design 
problem will fit. 

Examples of difficulties that may occur in 
particular problems include the dependence 
of boundary conditions on design parameters 
and design variables, inequality constraints for 
stress which involve state and design variables 
that must be transformed to functional con¬ 
straints, and interrelationships between eigen¬ 
values and state variables at particular points 
in the structure. Each of these peculiar 
features will be treated as it arises in a 
particular problem. This requires that the 
designer who is using continuous optimization 
methods for structural design must under¬ 
stand the origin of the methods well enough 
so that he can alter the computational al¬ 
gorithms as required to fit his particular 
problems. 

While the previous discussion might indi¬ 
cate that one encounters only difficulties 
when using continuous methods as opposed 
to discrete methods, there appears to be a 
potential for more efHcient computational 
methods in the infinite dimensional problem 
than in the finite dimensional formulation. 
Further, it is clear that the infinite dimen¬ 
sional formulation can yield a true optimum 
while the discrete formulation of the problem 
will generally yield only an approximate 
optimum. In the following paragraph, a rela¬ 
tively general formulation of the infinite 
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dimensional optimal structural design prob- 
lem is given, and the computational algorithm 
based on theory of Chapter 8 is developed. In 
following paragraphs, this theory and com¬ 
putational algorithm are applied to example 
problems. Alterations in the general theory 
are made as they are required in the solution 
of individual problems. 

9-2 STEEPEST DESCENT METHOD FOR 
OPTIMAL STRUCTURAL DESIGN 

In the structural design problems treated 
here, the design is to be specified by a vector 
design function u{x) = [wi(x), .... 
and a vector design parameter b = [hi, .... 
bp]^, where the independent variable x may 
be a real variable, a two dimensional variable 
X = [Xi, X 2 J or a three dimensional variable 
X = [xi. X 2 , Xal^, depending on whether 
material is to be distributed over a line, a 
surface, or a volume. In addition to the design 
variables w(x) and b, there will be state 
variables z(x) = [ri(x), .... z^(x)]^ represent¬ 
ing stress and displacement under load and 
y(x) = [yi(jc), representing mode 

shapes for vibration or buckling. 

For the purposes of convenience in nota¬ 
tion and generality, the system equations will 
be written in operator notation similar to that 
used in par. 8-4. Only linear behavior will be 
considered so that stress and deflection are 
determined by the linear boundary-value 
problem 

I(M.b)z = Q(M.b), xG^2 (9-1) 

and 

Bz^q, xer (9-2) 

In this notation, ^2 is the region over which 
the material of the structure is distributed and 


r is the boundary of that region. L and B are 
differential operators on and P, respective¬ 
ly. The functions Q and q are generally 
related to loads. 

To better fix the idea of operators, con¬ 
sider the simply supported beam of Fig. 9-1. 



Figure 9-1. Simply Supported Beam 


The boundary value problem in this case is 
simply 



Here, w(x) is a variable which uniquely speci¬ 
fied the beam cross section and determines 
the design, E is Young’s modulus, z(x) is 
deflection, and M{x) is the bending moment 
that is computed from the distributed load 
P(x). 

In this example, n is just the interval (0,1) 
and r consists of the two endpoints x = 0 
and X = 1. The advantage in the notation of 
Eqs. 9-1 and 9-2 is that it is convenient and at 
the same time applied to a large class of 
problems. For an example of a problem in 
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which n is a subset of two dimensional space, 
see par. 9-7. 


In addition to response due to static load, 
it is necessary to treat the response of a 
structure to dynamic loads. One important 
characteristic of a structure, which is classi¬ 
fied as dynamic response, is natural fre¬ 
quency. Another response, which must be 
treated, is buckling. Both buckling loads and 
natural frequencies are determined, in the 
present class of problems, by linear eigenvalue 
problems. Again using operator notation, 
these problems may be written in the form 

K{u,b)y^iM{M.b)y. (9-5) 


Constraints of the form 0 for 

will have to be reduced to functional con¬ 
straints as in Eq. 8 - 6 . 

Beginning with an engineering estimate of 
the design variables u{x) and b, Eqs. 9-3 
through 9-6 may be solved for z(x), y(x), and 
f. A perturbation, ( 6 w, 6 b), in the design leads 
to perturbations in the cost and constraint 
functions 

5/ = -^6ft + —6f+ 52 

hb da ^ J 


~ 6 « + -^ Sb\ dSl 
du db / 


(9-10) 


and 

0 = 0, jc€r (9-6) 

where f is the eigenvalue and y(x) the 
associated eigenfunction or mode shape. The 
operators K and M generally relate to stiffness 
and mass, respectively. In conservative prob¬ 
lems they will be symmetric (Ref. 1) (formal¬ 
ly self-adjoint) but in nonconservative sys¬ 
tems, they will not be symmetric. The more 
general unsymmetric case is treated here. 

The optimal design problem is that of 
minimizing 


50 6 «, xea, /=!.r (9-11) 

' du 

+ i!L 5 « +-^ dSl, 

du db / 

/=l,...,s. (9-12) 

The object, as in preceding work, is to 
eliminate explicit dependencies on 6 z and 6 {‘. 
First, the perturbation equation for 6 z is 


/ = /o(^f) + / / /i {x,z,u,b)dP. (9-7) 

n 

subject to the pointwise constraints 

0,(x,m)<O, i=l,...,r (9-8) 

and the functional constraints 

= f f gf{x,ZM»b)dP < 0 , 

n 

,5 (9-9) 


I(M,b)6z+--— lL(u,b)z]du 
du 

+ -^lL(u.b)z]6b = -^ Su 
db du 

+ xen 

Ob 

Bbz = 0, X on r . 


(9-13) 


In certain problems, the boundary con¬ 
ditions may depend on the design parameter 
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b. In that case, the perturbed boundary 
condition is 

d 

[^(b)z]6b=‘0 

ob 

instead of Eq. 9-13. In this case, methods of 
par. 8-4 must be applied to particular prob¬ 
lems. 


and 







To eliminate 5z, integration of 
/ / X^I6z by parts yields operators I • in 
lAnd B* on P such that 


/ / VhTlbz - 6z^I♦XJd^2 = 0 (9-14) 

n 





bud^ 




for all bz and X satisfying Bbz = 0 and ^*X = 
0 on r. 

Solving 

, xen 

bz 

B*\^ = o, xer 

and 

L*\*/ = ^ , xesi 
dz 

B*x'''/ = o, xer 


(9-16) 


(9-15) 



db 


- lL(u.b)zlj j dn j (9-18) 


It remains only to eliminate explicit depen¬ 
dence on 6f. Under very restrictive hypoth¬ 
eses, Kato (Ref. 2) has obtained a relationship 
among 5{‘, bb, and bu. This relationship is 
derived here formally. It is assumed that 
and by depend continuously on bb and bu 
and further that the following perturbation 
formula holds: 


and substituting into Eqs. 9-10 and 9-12, 
using Eq. 9-13, one obtains 



K(u.b)6y + -^ [K{u.b)y]6u 
bu 

+ 4 : l^(u.b)y]Sb 
00 

= + lM(u,b)y]bu 

bu 

+ f [M(u,b)y]bb + m(u.b)by 

(9-19) 

Just as in Eq. 9-14, integration by parts 
may be used to obtain the operators K* - 
and C* adjoint to XT - fAf and C. These 
operators are defined by the relation 
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// w'^(Ky - Wy)dn = ff y'^(k*w 
n n 

- tMw)dSl 

for all w and y satisfying C*w = 0 and Cy 
= 0 on r. Let y satisfy 


= xen 

C*y = 0, jc€r 


(9-20) 


Premultiplying Eq. 9-19 by y’’’, integrating, 
and rearranging terms yields 

j ffy^Myda\s!; = 

^ n ’ 

/f ly^KSy - iy^Mdy] dSl 

n 

I 


iK(u,b)y] 


- iy^ ^ lM(u.b)y] J Su dn 


be emphasized that Eq. 9-21 is obtained by 
strictly formal calculations. There are deep 
mathematical questions concerning the gen¬ 
eral validity of this result. For a treatment of 
the subject of perturbation theory in linear 
operators, the reader is referred to Ref. 2. 

An expression for in terms of du and db 
may not be substituted into Eqs. 9-17 and 
9-18. Defining 

bb I aft [dft 

T T 

yjjdn 


*1 

du L du 

{M{u,b)y^ dn j «ft 

- umm) ]x' 

(9-21) 

The first term on the right is 

-1 

+ 

S !{y'^Kby-^y^Mhv]dQ. 


n 

r d / \ ^ 

" La;: ^ 

= n{^y^(K*y-m*y)]dD. = 0 

n 

-r(-^ W«.z»3']) y] 

by definition of the adjoint operator. It must 


(9-22) 


(9-23) 
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Define vector constraint functions 0(x) 
containing those constraint functions > 
0 and ^ containing constraint functionals 
> 0. Define 

A*(jr) = j-^ I ^' '*'•**' ^ 

(9-28) 

fi* = I for aU ! with > oj (9-29) 
and 

(x) = I for all/ with > ol . 


(9-24) 


(9-30) 


and 



Defining A0(x) and to be desired 

reduction in constraint error, the linearized 
problem is to choose 8u(x) and db to mini¬ 
mize 

SJ = i^^6b+ f f dSl (9-31) 

n 





(9-25) 

Eqs. 9-17 and 9-18 become 

6 / = + ff A^^hu dSl (9-26) 

n 


subject to 

- A0 < 0, for Jc€^2 (9-32) 

i*^Sb + f f A*^6u dSl-A^^O 
n 

(9-33) 

and 

Sb^W^Sb + f f 6u^W^(x)6u dSl 
n 

- £2 < 0 (9-34) 


and 

T 

= db + fj A^fbu dSl . (9-27) 

n 


The weighting matrices and W^(x) are 
chosen positive definite and £ is small. This 
optimization problem for du and db coincides 
with the problems of pars. 8-2 and 8-3 for 
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ordinary differential equations and with the 
problem of par. 8-4 for partial differential 
equations. 

Determination of a solution proceeds 
exactly as in Chapter 8 so only the resulting 
computational algorithm is given here. 


a 


and 


Steepest Descent Algorithm for Optimal 
Structural Design 


M 




>// A*’'*';' ^ 4*d£! 

n 


Step 1. Make an engineering estimate of 
the solution u^^\x) and 

Step 2. For / = 0,1, ..., solve Eqs. 9-1, 9-2, 
9-5, and 9-6 for z^\ y^\ and f 


Step 7. Choose stepsize 7 o > 0 and evalu¬ 
ate 

7 = -A^i;(27o(A^+Af^^) 


Steps. Check the constraints of Eqs. 9-8 
and 9-9 and form 0 and 0 contain¬ 
ing the constraints not strictly 
satisfied. 


Step 4. Solve the boundary value prob¬ 
lems, Eqs. 9-15 and 9-16, for 
and corresponding to 0^ > 0. 
Solve the eigenvalue problem, Eq. 
9-20, fory. 

Step 5. Choose the corrections in con¬ 
straint errors A0 and A0. 

Step 6. Evaluate 9/, 9^1, and A'^l in 

Eqs. 9-22 through 9-25 and com¬ 
pute 


= [27o 

bu “ 

If any components of y or /i(x) are 
negative, delete the corresponding 
components of 0 and 0, respective¬ 
ly, and return to Step 5. Other¬ 
wise, continue. 

Step 8. Compute 


d0 , 


bu 


n 


X 





a' dSl 






I-''.-) 

X (A-^ - A* 
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dimensional optimal structural design prob¬ 
lem is given, and the computational algorithm 
based on theory of Chapter 8 is developed. In 
following paragraphs, this theory and com¬ 
putational algorithm are applied to example 
problems. Alterations in the general theory 
are made as they are required in the solution 
of individual problems. 


r is the boundary of that region. L and B are 
differential operators on and P, respective¬ 
ly. The functions Q and q are generally 
related to loads. 

To better fix the idea of operators, con¬ 
sider the simply supported beam of Fig. 9-1. 


9-2 STEEPEST DESCENT METHOD FOR 
OPTIMAL STRUCTURAL DESIGN 

In the structural design problems treated 
here, the design is to be specified by a vector 
design function u(x) = [wi(x), .... u^ix)]^ 
and a vector design parameter b = lbi, ..., 
bp]^» where the independent variable x may 
be a real variable, a two dimensional variable 
X = [jci, X 21 or a three dimensional variable 
X = [xi, X 2 , Xal^, depending on whether 
material is to be distributed over a line, a 
surface, or a volume. In addition to the design 
variables w(x) and b, there will be state 
variables z(x) = [zi(x),.... z^(x)J^ represent¬ 
ing stress and displacement under load and 
yM = [yi(x),.... y^(x)]^ representing mode 
shapes for vibration or buckling. 

For the purposes of convenience in nota¬ 
tion and generality, the system equations will 
be written in operator notation similar to that 
used in par. 8-4. Only linear behavior will be 
considered so that stress and deflection are 
determined by the linear boundary-value 
problem 

L(u.b)z^Q(u,bl xen (9-1) 



Figure 9-1, Simply Supported Beam 


The boundary value problem in this case is 
simply 


L{u)z = EI{u) 


d}z 

dx^ 


-Mix) = Q 


(9-3) 


and 



(9-4) 


Here, w(x) is a variable which uniquely speci¬ 
fied the beam cross section and determines 
the design, E is Young’s modulus, z(x) is 
deflection, and Af(x) is the bending moment 
that is computed from the distributed load 
Pix). 


and 

Bz = q, xGP (9-2) 

In this notation, 12 is the region over which 
the material of the structure is distributed and 


In this example, 12 is just the interval <0,1 > 
and r consists of the two endpoints x = 0 
and X = 1. The advantage in the notation of 
Eqs. 9-1 and 9-2 is that it is convenient and at 
the same time applied to a large class of 
problems. For an example of a problem in 
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which is a subset of two dimensional space, 
see par. 9-7. 

In addition to response due to static load, 
it is necessary to treat the response of a 
structure to dynamic loads. One important 
characteristic of a structure, which is classi¬ 
fied as dynamic response, is natural fre¬ 
quency. Another response, which must be 
treated, is buckling. Both buckling loads and 
natural frequencies are determined, in the 
present class of problems, by linear eigenvalue 
problems. Again using operator notation, 
these problems may be written in the form 

K{u, b)y = fAf (M, h)y. (9-5) 

and 

0 = 0, xer (9-6) 


Constraints of the form ri{x,z,u,b)< 0 for xEfl 
will have to be reduced to functional con¬ 
straints as in Eq. 8 - 6 . 

Beginning with an engineering estimate of 
the design variables u{x) and 6 , Eqs. 9-3 
through 9-6 may be solved for z(jc), y(jc), and 
f. A perturbation, (6 m, 66 ), in the design leads 
to perturbations in the cost and constraint 
functions 

+ 4^ 6 m + (9-10) 

hu bb / 

50 =4^ 6 „, x&Sl, i=l.....r (9-11) 
' bu 


where f is the eigenvalue and yix) the 
associated eigenfunction or mode shape. The 
operators K and M generally relate to stiffness 
and mass, respectively. In conservative prob¬ 
lems they will be symmetric (Ref. 1) (formal¬ 
ly self-adjoint) but in nonconservative sys¬ 
tems, they will not be symmetric. The more 
general unsymmetric case is treated here. 

The optimal design problem is that of 
minimizing 



66+-^ 6f + 

/le> 

+A 

. 6m + 6b\ 

da. 

du 

bb ) 


/= 1 , 

..., s . 

(9-12) 


The object, as in preceding work, is to 
eliminate explicit dependencies on 6 z and 
First, the perturbation equation for 6 z is 


+ / / /i ix.z.u.b)dP. (9-7) 

n 

subject to the pointwise constraints 

0 ^(x.m)<O, xep., /=l,...,r ( 9 - 8 ) 

and the functional constraints 

= ^/(h.f) + / / gjix,z,u,b)dP < 0, 
n 

/= 1,...,^ (9-9) 


£(m,6)6z+--— [L(m,6)z]6m 
bu 

b bO 

^ ll(u,b)z]6b = 6u 

bb bu 

bQ 

+ 66 , xSP 

Ob 

B6z = 0, X on r . 


> (9-13) 


In certain problems, the boundary con¬ 
ditions may depend on the design parameter 
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b. In that case, the perturbed boundary 
condition is 

d 

ob 

instead of Eq. 9-13. In this case, methods of 
par. 8-4 must be applied to particular prob¬ 
lems. 


and 


df 


6f + 





To eliminate 6z, integration of 
/ f Lbz d^l by parts yields operators in 
flTand on F such that 


/ / [X^I6z - Sz^L*X]dn = 0 (9-14) 

n 



for all 6z and X satisfying BSz = 0 and ^^X = 
Oon r. 

Solving 


= , xesi 

dz 

B*\-' = o, xer 

id 

L*\*f = ^ , x&a 
dz 

^♦x'’'^ = o, xer 


(9-15) 


(9-16) 


+ \*f 



- [L{u.b)z j dJi j db (9-18) 


It remains only to eliminate explicit depen¬ 
dence on 6f. Under very restrictive hypoth¬ 
eses, Kato (Ref. 2) has obtained a relationship 
among di, db, and du. This relationship is 
derived here formally. It is assumed that 8^ 
and 8y depend continuously on 8b and 8u 
and further that the following perturbation 
formula holds: 


and substituting into Eqs. 9-10 and 9-12, 
using Eq. 9-13, one obtains 



9-4 


K(u,b)dy +-^ lKiu.b)y]6u 
du 

+-^ lK(u.b)y]6b 

= ^M{u.b)y + r [M(u.b)y 1«« 
hu 

+ r4: [M{u.b)y\hb + muMby 
00 

(9-19) 

Just as in Eq. 9-14, integration by parts 
may be used to obtain the operators K* - 
and C* adjoint io K - and C. These 
operators are defined by the relation 
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/ / w^(A> - ^My)dSl = f f y^ik*w 


- iMw)dSl 

for all w and y satisfying C*w = 0 and Cy 
= 0 on r. Let y satisfy 


K*y = ^M*y, xSO 


C*y = 0, xer 


(9-20) 


Premultiplying Eq. 9-19 by y^, integrating, 
and rearranging terms yields 

j //y^Afydnler* 

' n ' 

f j[y'^Kby - iy'^Mby] dSl 




[K(u,b)y] 


- fy ^ ^ [M{u,b)y] I Su dSl 




I «(”4 


- mu.b)y}j dSi j Sb 


(9-21) 


The first term on the right is 


f / (y ^/:6y - (y^MSvl dSl 


= / / iK*y - iM*y)] dSl^O 


by definition of the adjoint operator. It must 


be emphasized that Eq. 9-21 is obtained by 
strictly formal calculations. There are deep 
mathematical questions concerning the gen¬ 
eral validity of this result. For a treatment of 
the subject of perturbation theory in linear 
operators, the reader is referred to Ref. 2. 

An expression for 6f in terms of 6 m and db 
may not be substituted into Eqs. 9-17 and 
9-18. Defining 

db \ db La* 

T 

- \Miu.b)y]j 


(9-22) 


. a// , fag^ 
du L au 

* [■^/// ’"■"H 

J* 

T n 

- r [Af(«,*)>'l) yj 


(9-23) 
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£^=i!L +ff(^+ 

db •'i \ 9b La* 

- (i-'«“•«*') 

T 

-t(-^ («(«,w) 7 ]) 


dSl 

(9-24) 


and 


A*^ 


= v" 

du |_ du 

-(l-tiww) ]x" 

*[^/ 

T 

-f(.|.|«(«,Wyl) 7] 


(9-25) 


Eqs. 9-17 and 9-18 become 


bJ = Si^^bb + / / K^^bu d^l (9-26) 


n 


and 


6^ + f f A ^bu dfl. (9-27) 


Define vector constraint functions 0(x) 
containing those constraint functions 0^(x) > 
0 and ^ containing constraint functionals 
> 0. Define 


A*(x) 


[30,7 

nr 


for all I with 0y(x) > 


0 


(9-28) 


I for all / with > 0 (9-29) 


and 

(x) = * for all/ with > oj . 

(9-30) 

Defining A0(x) and to be desired 

reduction in constraint error, the linearized 
problem is to choose 6w(x) and bb to mini¬ 
mize 

bj^i^^6b+ ff A-'^bu da (9-31) 


subject to 


iT 


bu — A0 < 0, forxEf2 


(9-32) 


e*^Sb + ff A*^6u da-Aip<0 


(9-33) 


and 

db^W^db + ff Su^WJx)Su da 
n 

< 0 


(9-34) 


The weighting matrices and W^ix) are 
chosen positive definite and { is smaU. This 
optimization problem for bu and bb coincides 
with the problems of pars. 8-2 and 8-3 for 
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+ ff 

n 

-1 -i\ * 

Determination of a solution proceeds V ~ du dJT ) ^* 

exactly as in Chapter 8 so only the resulting 
computational algorithm is given here. 


Steepest Descent Algorithm for Optimal 
Structural Design 

Step 1. Make an engineering estimate of 
the solution and 

Step 2. For / = 0,1, ..., solve Eqs. 9-1, 9-2, 
9-5, and 9-6 for z^>. y^\ and 

Step 3. Check the constraints of Eqs. 9-8 
and 9-9 and form 0 and 0 contain¬ 
ing the constraints not strictly 
satisfied. 


M„.fj 

n 

Step 7. Choose stepsize 7© > 0 and evalu¬ 
ate 

and 


Step 4. Solve the boundary value prob¬ 
lems, Eqs. 9-15 and 9-16, for X-' 
and / corresponding to 0^ > 0. 
Solve the eigenvalue problem, Eq. 
9-20, fory. 


Kx) = -Mjl2yo A« 
30 - 


Step 5. Choose the corrections in con¬ 
straint errors A0 and A0. 

Step 6. Evaluate A*^, and A'^l in 
Eqs. 9-22 through 9-25 and com¬ 
pute 


If any components of y or p(x) are 
negative, delete the corresponding 
components of 0 and 0, respective¬ 
ly, and return to Step 5. Other¬ 
wise, continue. 

Step 8. Compute 




dp 


' M “ bu 


n 

’‘('-ir".'.’!-'*'--)'''''" 


1 

00 


du “ / 
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6uHx) = W-^(l- 
du “ / 


bu ** 




6fe' =«'-*(£•'- 


and 

and form 

= ^ 6w’(jc) 

27o 


+ 6w^(x) 


and 

^(/+i) =^(/)-L 6^1 +6^2 

270 


the method of steepest descent, to illustrate 
the direct application of this technique to 
optimal structural design. The mathematical 
formulation of the problem is given in par. 
7-2 and will be used here with a change in 
notation to be consistent with par. 9-2. 

Fixing cross-sectional geometry and aUow- 
ing cross-sectional area u(x) to vary, as in par. 
7-2, yields 

I(x)^au^ix) (9-35) 

The optimization problem is to choose m(x), 0 
< X < L to minimize 

/ u(x)dx (9-36) 

subject to the constraints 

}p=Po-P<0 (9-37) 

0=P/w(jc)-a^,^< 0 (9-38) 


Step 9. If aU constraints are satisfied and 
6M*(Jf) and db^ are sufficiently 
small, terminate. Otherwise, return 
to Step 2. 


and 


d^y 1 

^(u)y ="7T = “^ T— T 
dx^ Eotu} 


y^PM(u)y 


The results of Theorem 8-2 hold for the 
optimal perturbations, and it may be shown 
that a necessary condition for convergence to 
a local optimum is 6 u’(jc) and bb^ approach 
zero. Discussions of Chapter 8 on use of the 
algorithm apply. They will not be repeated 
here. 

9-3 A MINIMUM WEIGHT COLUMN 

A minimum weight column problem has 
been solved in pars. 5-4 and 7-2 to illustrate 
the use of two optimization techniques. The 
same problem is solved in this paragraph, by 


(9-39) 


->(0) - 


"o" 

dy 





_ 0 _ 


where the coordinate system is as shown in 
Fig. 7-1 with y replacing x and x replacing t. 

In its present form, the boundary-value 
problem is just as in Eqs. 9-5 and 9-6 and is 
self-adjoint, so y = y in par. 9-2. For use in 
the steepest-descent algorithm, Eq. 9-23 is 
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a'= \ 


(9-41) 


and Eq. 9-25 is 


\* =- /f^ r y^M 1 

Eotu^ix)/ [£aw^(jc) J 


dx 


(9-42) 

The computational algorithm of par. 9-2 
now applies directly to the present problem. 
The solution of the eigenvalue problem, Eqs. 
9-39 and 9-40, was obtained using the finite 
element analysis technique outlined in par. 
5-4. The solution of this continuous problem 
required approximately the same time per 
iteration as the discrete technique but fewer 
iterations were generally required for con¬ 
vergence. Exactly the same results as given in 
par. 7-2 were obtained. The reader is referred 
to that paragraph for a tabulation of results. 

9-4 A MINIMUM WEIGHT VIBRATING 
BEAM 


Specifically, the problem considered here is 
the determination of the distribution of ma¬ 
terial along the centerline of a simply sup¬ 
ported beam (see Fig. 9-2) so that the beam 
will be as light as possible and still have its 
fundamental frequency at least as large as a 
predetermined frequency o)©- Further, so that 
the beam can support a minimum level of 
bending moment, it is required that the 
second moment of its cross-sectional area 
shall always be at least as large as a positive 
constant Iq . 



L 


Figure 9-2. Simply Supported Vibrating Beam 

As in the column problem of the preceding 
paragraph and par. 7-2, the geometry of the 
cross section is fixed and all dimensions are 
allowed to vary proportionally. If the area is 
denoted u(j\ then 


As was pointed out in par. 7-2, the paper 
by Keller (Ref 11, Chapter 7) in 1960 
presented a mathematically elegant method of 
designing the minimum weight column. The 
same method was applied by Niordson (Ref. 
12, Chapter 7) in 1965 to find the simply 
supported beam of maximum natural fre¬ 
quency for a given volume of material in the 
beam. This method of solution resulted in a 
horribly nonlinear differential equation with 
serious singularities. While a solution was 
obtained for the vibrating beam problem, it is 
doubtful that the method could be extended 
for the solution of multimember structural 
design problems. The methods of Chapter 8, 
on the other hand, are quite general and will 
be used in this paragraph to routinely solve a 
minimum weight beam design problem with 
constraints on natural frequency. 


m^0Lu^(t) (9-43) 

where o is the minimum second moment of a 
cross section with the given geometry and 
unit area. 

Since the material is to be specified with 
constant density, minimum weight is equiva¬ 
lent to minimum volume. The quantity to be 
minimized is, therefore, 

J u(t)dt . (9-44) 

The constraint on /(/) discussed previously 
can now be written as 

<l>^Io-otuHt)< 0 (9-45) 

where /q > 0 is given. 
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The remaining feature of the problem to be 
accounted for is the constraint on natural 
frequency. If the beam with au^(t) = /q has a 
fundamental natural frequency of cuq or 
higher, then this is clearly the optimum beam. 
On the other hand, if this beam has a natural 
frequency below a)o» then there must be 
points along the beam for which au^(t) > Iq 
and a meaningful design problem exists. The 
inequality on natural frequency is 

coq - Cu)< 0 (9-46) 


with boundary conditions 


yi(0) = y,(I) = 0 
>'2(0)=y2(^) = 0 


(9-49) 


The boundary-value problem, Eqs. 9-48 
and 9-49, is self-adjoint so y -y in par. 9-2. 
The optimal design problem is well-defined 
and the notation of par. 9-2 applies directly. 
From Eq. 9-23, 


A-' = 1 


There are several ways in which the natural 
frequency of vibration of a beam may be 
related to the design of the beam u(t). The 
relationship chosen here is the boundary-value 
problem describing lateral displacement 
during oscillation. It is given in Ref. 3 as 

(9-47) 

w(0) = w(i) = 0 
w"(0) = w"{L) = 0 

where prime denotes differentiation with 
respect to jc. 

In order to put the boundary-value prob¬ 
lem, Eq. 9-47, into the form Eqs. 9-1 and 9-2, 
define yi = w, yj = Eotu^id^yi/dx^), and 
po)* = f. The problem, Eq. 9-47, is then 


-1 

_1 

ii 

~ 0~ 

d^Vi 



WJ'l 

Ldx* “ 

Eau^J 



(9-48) 


and from Eq. 9-25, 



The computational steepest-descent algorithm 
may now be implemented in a direct manner. 

As a numerical example, the given problem 
was solved with the data £ = 3x10*^ psi, L = 
10 in., a = 1.0, and p = 0.00208 slug/in.^ The 
eigenvalue problem was solved through use of 
a finite element structural analysis program. 
Even though there was no attempt at making 
the computational routines efficient, only 7 
sec per iteration on an IBM 360-65 Computer 
were required. For most natural frequencies, 
10 to 15 iterations were sufficient for con¬ 
vergence to within numerical accuracy of the 
computations. Results for a range of natural 
frequencies are given in Table 9-1. The general 
shapes of profiles of several of the optimum 
beams are shown in Fig. 9-3 to illustrate the 
optimum distribution of material. 

9-5 A MINIMUM WEIGHT VIBRATING 
FRAME 

The distribution of material along members 
of the frame shown in Fig. 9-4 is to be 
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TABLE 9-1 


COMPARISON OF OPTIMAL BEAMS 


Frequency, 

rad/sec 

Volume of 
Uniform 
Beam of 
Length 

10 in.* 

Optimal 

Volume, 

in? 

Weight 

Reduction 

or 

Material 

Savings, 

% 

3600 

0.9353 

0.8967 

4.13 

4000 

1.1546 

1.0583 

8.34 

4400 

1.3971 

1.2536 

10.27 

4800 

1.6627 

1.4740 

11.35 

5200 

1.9514 

1.7189 

11.92 

5600 

2.2631 

1.9847 

12.30 

6000 

2.5980 

2.2705 

12.61 

10000 

7.2165 

6.3172 

12.46 


* Uniform beam of lowest volume having required natural 
frequency 


determined so that the frame is as lightweight 
as possible and has a fundamental natural 


3,600 rad/sec 





Figures^. Porta! Frame 


frequency greater than or equal to a given coq • 
Further, as a form of strength requirement 
I(x) > /o > 0 is required. 

For convenience, all members have the 
same length and all cross sections have the 
same geometry but may be scaled by a factor 
that varies with jc. In this case, the area of 
cross sections m^(x), i = 1,2,3, uniquely 
determine the design of the beams when the 
beam material is chosen. Further, the second 
moment of the cross-sectional ^reas are 

/,(■«) = a, 

where is a constant depending on the 
cross-sectional geometry chosen. 

Defining 

y\ 

=^2 
=£72 wi' 
ys 
y^ 
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the differential equations for vibration of the 
frame are 



~y'i 



“ij'i 


y'i- 

1 

£7, 


0 


y': 




Ky^ 

y'i- 

1 

EI 2 

= pa)^ 

0 


y'i 



“3>'5 


y'i- 

1 

El3 


0 _ 




(9-50) 


where f = po)^. Boimdary conditions are 


/—s 

0 

II 

0 

yiiL)^-ys(0) 

0 

II 

0 

/,(i)=/3(0) 

0 

II 

0 

m 

/3(^)=/5(0) 

;k3(£) = 0 

>'2(i)=n(0) 

j's(i) = 0 

y^(D=yei.L) 



y'2iL)*y’m = 




>(9-51) 


The boimdary-vahie problem, Eqs. 9-50 
and 9-51, is written in self-adjoint form. The 
last boundary condition in Eq. 9-51 is just 
Newton’s second law applied to horizontal 
motion of Member 2. This boundary con¬ 
dition does not fit Eq. 9-6 exactly due to 
dependence on the design variable U 2 (x), It 
will have to be treated as a special case 
according to the conunent following Eq. 9-13. 


9-12 


The perturbation boundary conditions 
from Eq. 9-51 are 


>(9-52) 


6>'i(0) = 0 (I) = -6^5(0) 

6/i(0) = 0 6/i(I) = 6/3(0) 

6^3(0) = 0 6/3(1) = 6/(0) 

6^3(1) = 0 6^3(1) = 6^4(0) 

dysiD^O by^(L)^by^(L) 
6/(1) = 0 
6/(1)+ 6/(0) = 

- I^f J M,(x)dicj6>',(Z,) 


Two integrations by parts and elimination of 
boimdary terms throu^ use of Eqs. 9-51 and 
9-52 yield 


r by^Ky dx-( y^Kby dx 




(9-53) 


Since the boundary value problem, Eqs. 
9-50 and 9-51, is self-adjoint, y = y in the 
general formulation. The derivation of Eq. 
9-21 holds and Eq. 9-53 may be substituted 
along with 



y^Mby 



hy^My dx 
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to obtain 



This is precisely the form of Eq. 9-27, and the 
remainder of the general derivation of the 
steepest descent algorithm is valid. It should 
be noted that this derivation is formal, and 
rigorous verification of Eq. 9-54 is expected 
to be extremely difficult. 

The algorithm of par. 9-2 was used to solve 
this problem in a direct manner. The eigen¬ 
value problem was solved approximately by a 
finite element method. Data for this problem 
are a = 0.07958, p = 2.616x10^ Ib-secVin.^, 
£ = 10.3x10® psi, Iq = 0.009825 in.®, and L = 
10.0 in. Weights of optimum frames are given 
in Table 9-2 for several frequency require¬ 
ments, and the profile of an optimum frame is 
shown in Fig. 9-5. 

TABLE 9-2 

WEIGHT OF OPTIMUM FRAMES 


Solving for , 


CJb. rad/sM 2000 3000 4000 5000 




1 


y\ **u^ylydx^ 



(9-54) 


By making the obvious choice for A^, Eq. 
9-54 can be written 




bu dx . 


(9-55) 


Optimum 

WeightJb 1.73 2.56 3.59 4.69 



Figure 9-5. Profile of Optimum Frame 

9-13 
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9^ A MINIMUM WEIGHT FRAME WITH 
MULTIPLE FAILURE CRITERIA 

In most structural design problems, con¬ 
straints of several kinds must be treated 
simultaneously. In the present problem, con¬ 
straints on member size, deflection, and 
buckling are enforced in the minimum weight 
design of the frame shown in Fig. 9-6. Area is 





3 





the third member moves - w (L) units to the 
right. Thus, by elementary beam theory (Ref. 
3) 


3w(L)EI^ 


(9-56) 


The differential equation of deformation of 
the first member is 


(£aiw?(^)w"]" = -q(x) (9-57) 


Figure 9-6. Laterally Loaded Frame 


and the boundary conditions are 


allowed to vary along the length of the first 
and second members but the geometrical 
shape of the cross section is fixed. Thus, 

I fix) = a.uj(x) 

where ot^ depends on the cross-sectional geom¬ 
etry and Wy(jc) is the cross-sectional area of the 
/th member at the point x. The size of the 
third member is fixed and all members are 
taken the same length. 

Free body diagrams of the members are 
shown in Fig. 9-7. 

The third member is uniform with constant 
modulus F/a- Further, axial deformation of 
the second member is neglected so the top of 

9-14 


w(0) = 0 


w'(0) = 0 


w'U) = 0 


- {EotiUiw”)* (I) = 


3EIM^) 


} (9-58) 


To get the boundary-value problem, Eqs. 
9-57 and 9-58, into the form of Eq. 9-1, 
define 


Zi = w 

Z 2 = Eoli Ui w" = Eoti u] Zi 


H9-59) 


The boundary-value problem, Eqs. 9-57 and 
9-58, is then 
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■r 1 
■ [:]■» 


(9-60) 


and 


Bz = 


~ 2l(0) 


"0“ 

2'l(0) 


0 

ZiiD 


0 



_0_ 


= <l 


(9-61) 

The equations which determine buckling 
load P of the second member are 


and 

0^ = - ot^ujix) + /o < 0 I = 1,2 (9-67) 

The constraint, Eq. 9-65, requires that the 
axial load T in the second member be less 
than or equal the buckling load. A limit S is 
placed on horizontal deflection of the top of 
the frame in Eq. 9-66. The constraints, Eq. 
9-67, are included to insure that member 
cross section does not go to zero anywhere. A 
more realistic constraint would be on bending 
stress but this would require a constraint of 
the form of Eq. 8 - 6 . This constraint will be 
included in subsequent work but will not be 
treated here. 

The constraints, Eqs. 9-65 and 9-66, do not 
fit directly into the basic formulation of this 
text and require special treatment. The lin¬ 
earized forms of these constraints are 

^ SziiD-SP< A}l/ (9-68) 
and 



and 


~y(or 


"0" 



_o_ 


(9-63) 


The objective in the design problem is to 
choose Ui(x) and U 2 (x) to minimize the 
weight of the first two members. 


y = 7 



lUi(x) U2(x)]dx 


(9-64) 


The constraints to be enforced are 

, /3£/3\ 

=- 0 (9-65) 

\I/2=-Zi(L)-S< 0 (9-66) 


- 5z,(I) < A 02 (9-69) 

It remains to obtain expressions for Szi(L) 
and dP explicitly in terms of du.(x). From Eq. 
9-21, dP may be expressed in terms of 61^2 (x). 

In order to obtain an expression for 
^ZiiL)y a Green’s identity similar to Eq. 9-14 
is needed. Integration twice by parts of 

f \^L6z dx yields 
-'0 



Ldz dx = 


■f 

•'0 


dx 


= (X,6zi=\;6r, 

+ Xj 6z J — Xj 6z I) 1^ 
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Choosing X such that L\ = 0, substituting for 
L6z from Eq. 9-13, and using the linearized 
boundary conditions of Eq. 9-61, this be¬ 
comes 



bz^iL) 


-X,(0)6zi(0) 

+ X;(0)6zj(0) 

+ X,(I)«z,(L) 
-X2(I)6z,(L) 

(9-70) 


The adjoint variable is chosen to satisfy L\ 
= 0 but no boundary conditions have yet been 
specified. Choosing 


— ~ ^2(^) “ 1 
Xi(0) = x;(0) = Xj(i) = o 
the identity, Eq. 9-70, becomes 

6ri( 


(9-71) 


P 2z2(j:)Xj(a:)1 , , 

[- X?(x) J 


(9-72) 


Thus an explicit relationship between bu and 
6zi(L) has been found and may be substi¬ 
tuted directly into the linear constraints, Eqs. 
9-68 and 9-69. With proper choice of nota¬ 
tion, these inequalities fit into the form of 
Eq. 9-33. 


In this problem the differential equations, 
9-16 


Eqs. 9-1 and 9-5, are formally self-adjoint so 
L* “ L, K* - K, and = 3/ in the general 
theory. Since the boundary-value problem, 
Eqs, 9-62 and 9-63, is self-adjoint, y = y and 
the computation required in Step 2 of the 
steepest descent algorithm is considerably 
reduced. 

This problem is now easily put in the form 
of the problem of par. 9-2. It was solved by 
direct application of the algorithm of that 
paragraph with the data 5 = 4 in., I = 100 in., 
E = 3.0x10^ psi, 0 £ = 0.07958, /q = 0.0147 
inf, and area of member 3 is 4.0 in? The 
volume of the optimum frame for several 
values of q is given in Table 9-3. The profile 
of an optimum frame is shown in Fig. 9-8. 


TABLE 9-3 


VOLUME OF OPTIMUM FRAME 


q, Ib/in. 

10 

15 

20 

25 

Optimum 





Volume, 

in? 

502.1 

531.1 

556.0 

653.9 


9-7 A MINIMUM WEIGHT VIBRATING 
PLATE 

In order to illustrate the use of the 
steepest descent method in hi^er dimen¬ 
sional problems, a minimum weight vibrating 
plate problem will be solved. A rectangular 
plate. Fig. 9-9, is specified by its thickness 
function h{xi, Xj) over the plate. The object 
here is to choose h{Xi, X 2 ) such that the 
weight of the plate is as small as possible 
subject to the constraint that the natural 
frequency of lateral vibration is greater than 
or equal to a given frequency o). Further, due 
to applied loads, a constraint of the form is 
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X 2 )- h{Xi, X 2 ) < 0 (9-73) 

enforced. In the present problem, X 2 ) 

is taken as a constant Hq. 

Denoting bending moments (Ref. 4) by>^i 
= Afy, and y 3 = and the lateral 
displacement by = w, the equations 
governing lateral vibration may be written 
(Ref. 4) in self-adjoint form as 


Figure 9-9. Simply Supported Plate 

yi(a, AC2) = 0,yi(0, X2)-0 
y4(a, :^2) = 0,y4(0, X2) = 0 

(9-75) 

y4(xi , 0) = 0, > 4 (^ 1 , ^) = 0 



(9-74) 

where f = po)^ and co is the natural frequency 
of vibration of the plate. The boundary-value 
problem for the simply supported plate with 
differential equations, Eqs. 9-74, is self- 
adjoint with the boundary conditions 


The coordinate system and simply supported 
boundary of the plate are shown in Fig. 9-9. 

To complete the formulation of this prob¬ 
lem in terms of the preceding theory, a cost 
function is defined by 


J= f f lh{Xi.X2)dXxdx2 (9-76) 

•/n 


where 7 is weight density of the plate 
material. The strength constraint in this prob¬ 
lem is taken as Eq. 9-73 and the eigenvalue 
constraint is 

pwo - f < 0 (9-77) 
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This problem is now in the form of the 
general problem of par. 9-2. The domain in 
this case is simply the rectangular region of 
the plate that is of dimension two. Further, 
since the boundary-value problem, Eqs. 9-74 
and 9-75, is self-adjoint, y = y in the theory of 
par. 9-2. 

Using the definition of K and M, <l> = hQ - 
h(xi,X 2 ) in Eq. 9-73, and = pa>S - f in 
Eq. 9-77, from Eq. 9-23, 


(Ref. 5). Data for this problem are a = ^ = 5.0 
in., E = 3.0x10“^ psi, p =7.43x10“^ lb-sec^/ 
in.^, V = 0.30, and w© = 1375 rad/sec. A 
uniform plate with f = pw^ = 1400 was taken 
as the initial estimate. The volume of the 
optimum plate is 10.71 in.^ Double sym¬ 
metry of the optimum plate was observed 
about the axes through (fl/2,^/2). One quarter 
of the optimum plate with contour lines is 
shown in Fig. 9-10. 




(9-78) 


and from Eq. 9-25 




_ r 36(j'j 


- + 18(1 +n)yl 


Eh^ 




dxtdxi 


(9-79) 

This problem is now in the form of the 
general problem of par. 9-2. It was solved by 
direct application of the algorithm of that 
paragraph. The eigenvalue and eigenfunction 
for the variable thickness plate were deter¬ 
mined approximately by the Ritz technique 



Figure 9-10. Contours of Optimum Plate 
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APPENDIX A 

CONVEXITY 


Convex functions and sets as defined in 
Chapter 2 play an important role in optimiza¬ 
tion theory. It is generally possible to obtain 
much more comprehensive results in non¬ 
linear programming problems that are convex 
than in the nonconvex case. Some of the 
more important results due to convexity are 
given in Chapter 4, par. 4-2. 

In order that this appendix is self- 
contained, definitions of general convex sets 
and functions will be repeated here. For a 
complete treatment of convexity, the reader 
is referred to Ref. 1. 

Definition A-1: Let D be a subset of /?'*. D 
is called a convex set if for any points x and y 
in A Jc + B(y - x) is also in £> for all 0 such 
that 0 < 0 < 1. 

The collection of points x + B(y - x), 0 < 
0 < 1, is just a straight line from x to y. Def. 
A-1 just says, then, that a set in/?'* is convex 
if the straight line joining any pair of points in 
the set lies entirely in the set. For example, in 
(the plane) the set of points inside the 
unit circle is convex (see Fig. A-1 (A)) whereas 
the star-shaped region in Fig. A-l(B) is not 
convex. 

Convex functions have as their prototype 
fix) = in /? ‘. The graph of this function is 
shown in Fig. A-2. Note in this figure that if a 
straight line is constructed between any two 
points [z, f(z)] and [y,/C>')),then this line is 
above the graph affix) at all points between z 



Figure A-1, Examples; Convex Case and 
Nonconvex Case 

and y. This is precisely the property which 
characterizes convex functions. Analytically, 
this property is expressed by the inequality 

f{z ’^Biy-z)\< fiz) + BUiy) - fiz)] 

for alio withO< 0 < 1. 

The same idea holds in /?'* where convex 
functions are characterized by 

Definition A-2: Let the real valued func¬ 
tion fix) be defined on the convex subset D 



Figure A-2. Graph of fix) = x^ in 


A-1 
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of /?'*. Then/(jf) is called a convex function if 
for any points z and y in A 

f\z + 00 - z)l < /(z) + 0[/O) -fiz)] 

(A-1) 

for all 6 with 0 < 0 < 1 . 

Convex functions and convex sets are 
related as is shown in 

Theorem A-1: The set of points D in /?" 
which satisfy gjix) < 0 , i= 1 , m, is convex 
if each of the functions^^(jc) is convex in/?”. 

One further property of convex functions 
is extremely important for applications. It is 
established by 

Theorem A-2: If /(jc) is differentiable and 
convex on the convex subset D of /?”, then 

fix) > f(y) ’^VfiyYiz - y) (A-2) 

for all z and y in D, 


All these desirable properties of convex 
functions will go to waste unless one is able to 
test a given function for convexity. The 
following three theorems provide useful tests: 

1. Theorem A-3: If fix) is twice contin¬ 
uously differentiable in a convex subset D of 
/?”, it is convex in D if an only if the 
quadratic form 

(A-3) 

is positive-semidefinite at each point in D. 

2. Theorem A-4: If the functions qjix\ i = 
1, .... r, are convex in the convex subset D of 
/?” and > 0 , / = 1 ,..., r, then 

is convex in D. 

3. Theorem A-5: If gix) is twice contin¬ 
uously differentiable, gix) < 0 , and gix) is 
convex; then, - l/lgix)] is convex. 
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APPENDIX B 

ANALYSIS OF BEAM-TYPE STRUCTURES 


Finite and discrete element methods of 
structural analysis (Refs. 1,3), require a 
knowledge of the behavior of each element in 
the structure. Once each element is described, 
then the governing equations of the entire 
structure may be derived. Energy methods are 
generally used to obtain the governing equa¬ 
tions. 

B-1 ELEMENT ANALYSIS 

In order to apply energy theorems for the 
analysis of a structure, the potential energy 
due to strain, kinetic energy, and change in 
external dimensions due to bending must be 
described. The basic idea is to assign general¬ 
ized displacement functions, which are of the 
form expected in structural deformation and 
that are uniquely specified when the displace- 
.ment of both ends of the beam is known. A 
typical beam with its deformation sign con¬ 
vention is shown in Fig. B-1. The displace¬ 
ment Ui, U 2 » U 3 , and 1/4 are components of 
endpoint displacement and 1/5 and are 
endpoint rotations. 

The longitudinal displacement of a point x, 
0 < X < £, on the beam due to longitudinal 
strain is approximated by 



Lateral displacement of the beam at a point 
X is approximated by 



Figure B-1. Basic Beam Element 


w(x) =-^ (2x* - + «*) —^ (2x® 

-22x* 

(B-2) 

It should be stressed that the longitudinal 
displacement 5 (x) is due only to longitudinal 
strain in the beam and not due to the change 
in length caused by the lateral displacement 
w(x). 

The potential energy PE due to deforma¬ 
tion of the beam is (Ref. 2): 



!*£/ ■^( 12 x- 68 )-|f(i 2 x- 6 £) 


+|f(6x-4£)+|f(6x-2£) \x 
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which is 


# 


2 “ t* 


y48* 

-y48* 

0 

0 

0 

0 

y48* 

y48* 

0 

0 

0 

0 

0 

0 

12 / 

- 12 / 

- 6/8 

- 6/8 

0 

0 

- 12 / 

12 / 

6/8 

6^2 

0 

0 

- 6/8 

6/8 

4/8* 

2 / 8 * 

0 

0 

- 6/8 

6/8 

2 / 8 * 

4/8* 


140 70 0 0 0 0 

70 140 0 0 0 0 

1 -rMfi 0 0 156 54 -228 138 , 

2 420 0 0 54 156 - 138 228 

0 0 - 228 - 138 48* - 38* 

0 0 1 38 228 - 38* 48* 


where w = [mj, M 3 . M 4 , M 5 , m^]^. 


Similarly, the kinetic energy KE of the 
beam is 




+ -r|(2x3 -3£x2 +£3) 


The shortening of the beam A£ due to the 
lateral displacement is 


A8 = 8 


I IdJc 


-/.'['-©I 

tJ[ 




+-j^(3x* -4ex + 2*) 


+^(3x*-2ex) dx 


■^(2x^ -3£x^) 


+ -^(x^ - 2£x* + ex) 


+ ^(x’-ex’) dx 


0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

A 

A 

3 

3 

1 

1 

11 

u 

58 

58 

20 

20 

A 

A 

3 

3 

1 

1 

U 

u 

58 

58 

20 

20 



1 

1 

8 

8 

0 

0 

20 

20 

15 


A 

A 

1 

1 

8 

8 

U 

u 

20 

20 

60 

15 


u (B-8) 
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B-2 VARIATIONAL PRINCIPLES 


y(u) = [/(U) + i2(w) 


(B-9) 


In most structural analysis work it is more 
convenient to use variational principles to 
describe the state of the structure than it is to 
use Newton’s laws directly. For use in analysis 
of the structures considered here, the basic 
variational principles are stated. These prin¬ 
ciples apply to systems that are unconstrained 
in the particular coordinate system chosen; 
i.e., once the coordinates, / = 1,.... n, that 
describe the state of the system are chosen, 
there are no algebraic relationships between 
these variables. Further, it is required that all 
force-state relationships for external or in¬ 
ternal forces are continuous, i.e., small 
changes in state yield only small changes in 
forces. 

Let h'' denote the work done by all forces 
on a structure due to an admissible displace¬ 
ment. Then, the system is called conservative 
if the work h'' done in any displacement that 
returns to the original state is zero; i.e., the 
system is conservative if no energy is required 
to change the state of the structure and bring 
it back to the original state along any path. A 
structure would be nonconservative, for 
example, if sliding friction or viscous damping 
were present. 

Starting from some reference state of the 
structure m©, define the stored energy in the 
structure at state u to be (/(u). Note that 
since the system is conservative, f/(u) depends 
only on u and not on the path the state 
variable followed in getting from Uq to u. 
Likewise, the negative of work done by the 
external forces acting on the structure due to 
the change in state variables will be denoted 
Q(u). Again, ^2(u) depends only on the final 
state and not on the path from Uq to u. With 
this notation, the total potential energy K(m) 
is defined as 


Equilibrium states of a structure can now 
be characterized in terms of the total poten¬ 
tial energy. For unconstrained conservative 
systems, a necessary and sufficient condition 
that u be an equilibrium state is that 


dV(u)= 2 


dV(u) 

du^ 


= 0 


for all This is equivalent to 


dy(u) 

dUf 


0, 1 = 1, ...,n. 


(B-10) 


(B-11) 


This result is proved in Ref. 2, page 23. 

A second result is that a conservative, 
unconstrained system is in stable equilibrium 
at a state u if and only if the total potential 
energy is a relative minimum at u. This result 
is proved in Ref. 2, page 30. For the kind of 
structural systems considered here, it is 
further shown. Ref. 2, page 211, that at a 
buckling load the second variation must 
be positive semi-definite, i.e., 

S* V(u) = Su^ -^(u)6u > 0 (B-12) 


and further that there is a such that 

6 * K(u) = 617^-^(!/)«« = 0. (B-13) 

bu^ 


The inequality, Eq. B-12, shows that zero is 
a relative minimum and Eq. B-13 shows that 
relative minimum is attained for bu = bu. For 
6 ^ V(u) treated as a function of 6 m, then, it is 
necessary that 
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=0, / =1. n 

OOUf 

2/-1 l\dZ / 

‘ "‘{fz) ' ]“ 

at du = Su. Thus, 

where m. are element masses. In the notation 

— (t/)6t/ = 0 (B-14) 

bu^ 

of Eq. B-15, 


""i. [(¥) 

7_ 

The condition, Eq. B-14, determines buckling 
loads for the structural system. All this 

Putting 


analysis requires, of course, that V{u) is at 
least twice continuously differentiable. 

II 

1 

(B-16) 


These laws are for static behavior of the 
structural system. Dynamically, the structure 
is governed by Lagrange’s equations of 
motion. First, define the kinetic energy T as 
the quadratic form 


Lagrange’s equations of motion are simply 
(Ref. 2, page 239) 


dt 



n . I 

— = 0,1 = 1 , 

aUf 


n. 


(B-17) 



B-3 EQUATIONS OF STRUCTURAL 
(B-15) ANALYSIS 


where 
M = [m^] 

and the are generalized masses. The 
generalized mass matrix M is defmed by the 
transformation 


The given variational principles may be 
applied to obtain the governing equations of 
structural analysis. The element properties 
described by Eqs. B-4, B-6, and B-8 may be 
used to generate the potential and kinetic 
energy of the entire structure. From the 
definitions, Eqs. B-9 and B-15, 


z ^f{Z) 
where 

Z= [Zi,...,Z^l 

and Z^ are components of physical displace¬ 
ment of the masses of the structure. There¬ 
fore, 



Z? 




B-4 


K = Z - £ (M* + jj*) F* (B-18) 
y * 


and 

T^XKE^ (B-19) 

/ 

where superscript / denotes theyth element of 
the structure, k denotes the components of 
displacement at joints, is the component 
of external force corresponding to m*, andu* 
is the displacement in the same direction as 
u* but due to the components of bar 
deformation. The displacements will be 
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determined from structure geometry and the 
of individual members given in Eq. B-S. 
For determining the equilibrium equations 
B-11, the displacements are generally 
neglected since they are at least quadratic in u 
so they will be small if no buckling occurs. It 
is just these quadratic terms, however, that 
predict buckling behavior of structures. 

If a composite displacement vector u is 
formed from the components of all the 
member displacements, then matrices K and 
M may be defined by 


2 2 / 
and 

2 2 / 

where summation is taken over all elements of 
the structure and and are defined in 
Eqs. B-4 and B-5. The equations for displace¬ 
ment, buckling, and dynamic motion may 
now be determined directly from Eqs. B-11, 
B-14, and B-17. 
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